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consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, 
Washington). The consensus sequence obtained therefrom is herein designated DNA55736. 

In light of an observed sequence homology between the DNAS5736 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. R88049, the Merck EST clone R88049 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 126 and is herein designated as DNA57693-1424. 

Clone DNA57693-1424 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 56-58 and ending at the stop codon at nucleotide positions 416-418 (Figure 126). The 
predicted polypeptide precursor is 120 amino acids long (Figure 127). The full-length PR01Q56 protein shown 
in Figure 127 has an estimated molecular weight of about 13,345 daltons and a pi of about 5.18. Analysis of 
the full-length PRO 1056 sequence shown in Figure 127 (SEQ ID NO: 199) evidences the presence of die 
following: a signal peptide from about amino acid 1 to about amino acid 18, a transmembrane domain from about 
amino acid 39 to about amino acid 58, a potential N-glycosylation site from about amino acid 86 to about amino 
acid 89, protein kinase C phosphorylation sites from about amino acid 36 to about amino acid 38 and from about 
amino acid 58 to about amino acid 60, a tyrosine kinase phosphorylation site from about amino acid 25 to about 
15 amino acid 32 and an amino acid sequence block having homology to channel forming colicin proteins from 
about amino acid 24 to about amino acid 56. Clone DNA57693-1424 has been deposited with ATCC on June 
23, 1998 and is assigned ATCC deposit no. 203008. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 127 (SEQ ID NO: 199), evidenced significant 
20 homology between the PRO 1056 amino acid sequence and the following Dayhoff sequences: PLMJHUMAN, 
A40533, ATNG_HUMAN, A55571, ATNG_SHEEP, S31524, GEN13025, RIC_MOUSE, A48678 and 
A10871J. 

EXAMPLE 54: Isolation of cDNA clones Encoding Human PRQ826 

25 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

cluster sequence from the Incyte database, designated 47283. This EST cluster sequence was then compared 
to a variety of expressed sequence tag (EST) databases which included public EST databases (e^., GenBank) 
and a proprietary EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 

30 et al.. Methods in Enzvmologv 266:460-480 MQQfry Those comparisons resulting in a BLAST score of 70 (or 
. in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA56000. 

In light of an observed sequence homology between the DNA56000 consensus sequence and an EST 

35 sequence encompassed within the Merck EST clone no. W69233, the Merck EST clone W69233 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 128 and is herein designated as DNA57694-1341. 
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Clone DNA57694-1341 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 13-15 and ending at the stop codon at nucleotide positions 310-312 (Figure 128). The 
predicted polypeptide precursor is 99 amino acids long (Figure 129). The full-length PR0826 protein shown 
in Figure 129 has an estimated molecular weight of about 1 1,050 daltons and a pi of about 7.47. Analysis of 
the full-length PR0826 sequence shown in Figure 129 (SEQ ID NO:201) evidences the presence of the 

5 Mowing: a signal peptide from about amino acid 1 to about amino acid 22. potential N-myristoylation sites from 
about amino acid 22 to about amino acid 27 and from about amino acid 90 to about amino acid 95 and an amino 
acid sequence block having homology to peroxidase from about amino acid 16 to about amino acid 48. Clone 
DNA57694-1341 has been deposited with ATOC on June 22, 1998 and is assigned ATCC deposit no. 203017. 
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 

10 alignment analysis of the full-length sequence shown in Figure 129 (SEQ ID NO:201). evidenced significant 
homology between the PR0826 amino acid sequence and the following Dayhoff sequences: CCU12315J, 
SCU96108_6, CELF39F10 4 and HELTHELHO. 

EXAMPLE 55: Isolation of cf>NA clones Encoding Human PRQ819 
15 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

cluster sequence from the Incytc database, designated 49605. This EST cluster sequence was then compared 
to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g.. GenBank) 
and a proprietary EST DNA database (UFESEQ* Incyte Pharmaceuticals. Palo Alto. CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
20 etal., fi»thnH« in Fimmiologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle. Washington). The 
consensus sequence obtained therefrom is herein designated DNA56015. 

In light of an observed sequence homology between the DNA56015 consensus sequence and an EST 
25 sequence encompassed within the Merck EST clone no. H65785. the Merck EST clone H65785 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 130 and is herein designated as DNA57695-1340. 

Clone DNA57695-1340 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 46-48 and ending at the stop codon at nucleotide positions 202-204 (Figure 130). The 
30 predicted polypeptide precursor is 52 amino acids long (Figure 13 1). The full-length PR0819 protein shown 
in Figure 131 has an estimated molecular weight of about 5,216 daltons and a pi of about 4.67. Analysis of the 
full-length PR0819 sequence shown in Figure 131 (SEQ ID NO:203) evidences the presence of the following: 
a signal peptide from about amino acid 1 to about amino acid 24, a potential N-myristoylation site from about 
amino acid 2 to about amino acid 7 and a region having homology to immunoglobulin light chain from about 
35 amino acid 5 to about amino acid 33. Clone DNA5769S-1340 has been deposited with ATCC on June 23, 1998 
and is assigned ATCC deposit no. 203006. 
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An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 131 (SEQ ID NO:203), evidenced significant 
homology between the PR0819 amino acid sequence and the following Dayhoff sequences: HSU03899 1, 
HUMIGUTEBJ, VG28HSVSA, AF031522J, PAD1_YEAST and AF045484J. 

5 EXAMPLE 56: Isolation of cDNA Clones Encoding Human PRO10Q6 

An initial candidate sequence from Incyte cluster sequence no. 45748 was identified using the signal 
algorithm process described in Example 3 above. This sequence was then aligned with a variety of public and 
Incyte EST sequences and a consensus sequence designated herein as DNA56036 was derived therefrom. 

In light of an observed sequence homology between the DNA56036 consensus sequence and an EST 
10 sequence encompassed within the Merck EST clone no. 489737, the Merck EST clone 489737 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 132. 

The entire nucleotide sequence of DNA57699-1412 is shown in Figure 132 (SEQ ID NO:204). Clone 
DNA57699-1412 contains a single open reading frame with an apparent translational initiation site at niwlf pt idg 
IS positions 28-30 and ending at the stop codon at nucleotide positions 1204-1206 (Figure 132). The predicted 
polypeptide precursor is 392 amino acids long (Figure 133). The full-length PRO 1006 protein shown in Figure 
133 has an estimated molecular weight of about 46, 189 daltons and a pi of about 9.04. Clone DNA57699-1412 
has been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains 
the correct sequence, and the sequences provided herein are based on known sequencing techniques. 
20 Analyzing the amino acid sequence of SEQ ID NO: 205, the putative signal peptide is at about amino 

acids 1-23 of SEQ ID NO:205. The N-glycosylation sites are at about amino acids 40-43, 53-56, 204-207 and 

373-376 of SEQ ID NO:205^An N-myristoylation site is at abom amino acids 273^278 of SEQ ID NO:205 ~ 

The corresponding nucleotides of these amino acid regions and others can be routinely determined given the 
sequences provided herein. 

25 

EXAMPLE 57: Isolation of cDNA Clones Encoding Human PRO 11 12 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a specific 
EST cluster sequence. This EST cluster sequence was then compared to a variety of expressed sequence tag 
(EST) databases which included public EST databases (e.g., GenBank) and a proprietary EST DNA database 

30 (UFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was 
performed using the computer program BLAST or BLAST2 (Altshul et al„ Methods in Enzymologv 266:460- 
480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not 
encode known proteins were clustered and assembled into a consensus DNA sequence with die program "pfarap" 
(Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is 

35 herein designated DNAS6018. 

In light of an observed sequence homology between the DNA56018 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. AA223546, the Merck EST clone AA223546 was 
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purchased and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length 
protein. The sequence of this cDNA insert is shown in Figure 134 and is herein designated as DN A57702-1476. 

The entire nucleotide sequence of DNA57702-1476 is shown in Figure 134 (SEQ ID NO:206). Clone 
DNA57702-1476 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 20-22 and ending at the stop codon at nucleotide positions 806-808 of SEQ ID NO:206 (Figure 134). 
5 The predicted polypeptide precursor is 262 amino acids long (Figure 135). The full-length PR01112 protein 
shown in Figure 135 has an estimated molecular weight of about 29,379 daltons and a pi of about 8.93. Figure 
135 also shows the approximate locations of the signal peptide and transmembrane domains. Clone DNA57702- 
1476 has been deposited with the ATCC on June 9, 1998. It is understood that the deposited clone has the actual 
nucleic acid sequence and that the sequaices presided hereto are ^ 
10 Analysis of the amino acid sequence of the full-length PROll 12 polypeptide suggests that it possesses 

some sequence similarity to other proteins. More specifically, an analysis of the Dayhoff database (version 
35.45 SwissProt 35) evidenced some sequence identity between the PROll 12 amino acid sequence and at least 
the following Dayhoff sequences, MTY20B11J3 (a mycobacterium tuberculosis peptide). F64471, 
AE000690_6, XLU16364J, E43259 (H+-rransporting ATP synthase) and PIGSLADRXEJ (MHC class II 
IS histocompatibility antigen). 

KX AMPLE 58: T«nl»ri«wi of cD WA clones Encoding Human PRQ1Q74 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence (Incyte cluster sequence No. 42586). This cluster sequence was then compared to 

20 a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and 
a proprietary EST DNA database (LIFESEQ™ Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al„ Mnfrnds in Enzvmologv 266;46(M80 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 

25 DNA sequence with the program "phrap" (Phil Green, Univ. of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA56251. 

In light of an observed sequence homology between the DNA56251 consensus sequence and an EST 
sequence encompassed within the Merck EST done no. AA081912, the Merck EST clone AA081912 was 
purchased and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length 

30 protein. The sequence of this cDNA insert is shown in Figure 136 and is the full-length DNA sequence for 
PRO1074. Clone DNA57704-1452 was deposited with the ATCC on June 9. 1998, and is assigned ATCC 
deposit no. 209953. 

The entire nucleotide sequence of DNA57704-1452 is shown in Figure 136 (SEQ ID NO:208). Clone 
DNA57704-1452 contains a single open reading frame with an apparent translational initiation site at nucleotide 
35 positions 322-324 and ending at the stop codon at nucleotide positions 1315-1317 (Figure 136). The predicted 
polypeptide precursor is 331 amino acids long (Figure 137). The full-length PRO1074 protein shown in Figure 
137 has an estimated molecular weight of about 39.512 Daltons and a pi of about 8.03. Analysis of the full- 
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length PRO1074 sequence shown in Figure 137 (SEQ ID NO:209) evidences the presence of the following 
features: a transmembrane domain at about amino acids 20 to 39; potential N-glycosylation sites at about amino 
acids 72 to 75, 154 to 157, 198 to 201 , 212 to 215, and 326 to 329; a glycosaminoglycan attachment site at about 
amino acids 239 to 242, and a Ly-6/u-PAR domain at about amino acids 23 to 36. 

Analysis of the amino acid sequence of the full-length PRO1074 polypeptide suggests that it possesses 

5 significant sequence similarity to beta 1 ,3-galactosyltransferase, thereby indicating that PRO1074 may be a novel 
member of the galactosyltransferasc family of proteins. Analysis of the amino acid sequence of the full-length 
PRO 1074 polypeptide using the Dayhoff database (version 35.45 SwissProt 35) evidenced homology between 
the PRO1074 amino acid sequence and the following Dayhoff sequences: AF029792J, P_R57433, 
DMU41449J, AC000348J4, PJW7479, CET09F5_2, CEF14B6_4, CET15D6_5, CEC54C8_4, and 

10 CEE03H4J0. 

Clone DNA57704-1452 was deposited with the ATCC on June 9, 1998, and is assigned ATCC deposit 
no.209953. 

EXAMPLE 59: Isolation of cDNA clones Encoding Human PRO 1005 

15 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

cluster sequence from the L1FESEQ* database, Incyte cluster sequence no. 49243. This EST cluster sequence 
was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g., GenBank) and a proprietary EST DNA database (LIFESECJ*. Incyte Pharmaceuticals, Palo Alto, CA) to 
identify existing homologies. The homology search was performed using the computer program BLAST or 

20 BLAST2 (Altshul et al., Methods in Fm vmolopv 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program "phTap-tPtiil'Green, Universityof Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56380. 

In light of an observed sequence homology between the DNA56380 consensus sequence and an EST 

25 sequence encompassed within the Merck EST clone no. AA256657, the Merck EST clone AA256657 was 
purchased and the cDNA insert was obtained and sequenced. It was found that this insert encoded a fulMength 
protein. The sequence of this cDNA insert is shown in Figure 138 and is herein designated as DNA57708-141 1 . 

The full length clone shown in Figure 138 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 30-32 and ending at the stop codon found at nucleotide positions 

30 585-587 (Figure 138; SEQ ID NO:210). The predicted polypeptide precursor (Figure 139, SEQ ID NO:21 1) 
is 185 amino acids long. PRO 1005 has a calculated molecular weight of approximately 20,331 daltons and an 
estimated pi of approximately 5.85. Clone DNA57708-141 1 was deposited with the ATCC June 23, 1998, and 
is assigned ATCC deposit no. 203021. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

35 alignment analysis of the full-length sequence shown in Figure 139 (SEQ ID NO:211), evidenced some 
homology between the PRO1005 amino acid sequence and the following Dayhoff sequences: DDU07187_1, 
DDU87912J, CELD1007J4, A42239, DDU42597J, CYAGJMCDI, S50452, MRKCKLEPN, P-R41998, 



425 



PCT/US99/12252 

WO 99/63088 
and XYNA_RUMFL. 

AMPLE 60: «f '•"f* * clones Fncorifflff Human 

An initial DNA sequence referred to herein as DNA55938 and shown in Figure 142(SEQIDNO:214) 
was identified using a yeast screen, in a human SK-Lu-1 adenocarcinoma cell line cDNA library that 
5 preferentially represents the 5' ends of the primary cDNA clones. DNA55938 was then compared to ESTs from 
public databases (e.g., GenBank), and a proprietary EST database (LIFESEQ*, Incyte Fhannaonnicals, Palo 
Alto, CA), using the computer program BLAST or BLAST2 [ Alttchul et al. . Mf^ m BlBVIHrtWr. 266:460- 
480 (1996)]. The ESTs were clustered and assembled into a consensus DNA sequence using the computer 
program -phrap" (Phil Green. University of Washington, Seattle, Washington). The consensus sequence 

10 obtained is designated herein as DNA56411. 

In light of an observed sequence homology between the DNA56411 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. H86027. the Merck EST clone H86027 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a roll-length protein. 
The sequence of this cDNA insert is shown in Figure 140. 
15 The full length DNA57710-1451 clone shown in Figure 140 contained a single open reading frame with 

an apparent translation^ initiation site at nucleotide positions 345-347 and ending at the step codon found a, 
nucleotide positions 1242-1244 (Figure 140; SEQ ID NO:212). The predicted polypeptide precursor (Figure 
141 SEQ ID NO:213) is 299 amino acids long. PRO1073 has a calculated molecular weight of approximately 
34 689 dahons and an estimated pi of approximately 11.49. The PRO1073 polypeptide has the following 
20 additional features: a signal peptide at about amino acids 1-31. sequence identity to bZIP transcription factor 
basic domain signature at about amino acids, a potential N-glycosylation site at about amino acrfs 2-5. and 
sequence identity with protamine PI proteins at about amino acids 158-183. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35). using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 141 (SEQ ID NO:213). revealed some sequence 
25 identity between the PRO1073 amino acid sequence and the following Dayhoff sequences: MMU37351.1. 
ATAC00250510T1W22.10. S59043. ENXNUPRJ. B47328, SR55_DROME. S26650. SONHUMAN, 

VTT2 CHICK, and XLC4SRPRT_1. 

" Clone DNA57710-1451 was deposited with the ATCC on July 1. 1998 and is assigned ATCC deposit 



no. 203048. 



30 



EXAMPLE 61 -Ration of c r >NA clones EP "vtin p Human PROU52 

A cDNA clone (DNA57711-1501) encoding a native human PROH52 polypeptide was identified by 
employing a yeast screen, in a human infant brain cDNA library that preferentially represents the 5' ends of the 
primary cDNA clones. Specifically, a yeast screen was employed to identify a cDNA designated herein as 
35 DNA55807 (SEQ ID NO:217; see Figure 145). 

In light of an observed sequence homology between the DNA55807 sequence and an EST sequence 
encompassed within the Merck EST clone no. R56756. the Merck EST clone R56756 was purchased and the 
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cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 143. 

The full-length DNA5771 1-1501 clone shown in Figure 143 contains a single open reading frame with 
an apparent translational initiation site at nucleotide positions 58-60 and ending at the stop codon at nucleotide 
positions 1495-1497 (Figure 143). The predicted polypeptide precursor is 479 amino acids long (Figure 144). 

5 The full-length PRO 1 152 protein shown in Figure 144 has an estimated molecular weight of about 53,602 daltons 
and a pi of about 8.82. Analysis of the full-length PROl 152 sequence shown in Figure 144 (SEQ ID NO:216) 
evidences the presence of the following: a signal peptide from about amino add 1 to about amino acid 28, 
transmembrane domains from about amino acid 133 to about amino acid 155, from about amino acid 168 to 
about amino acid 187, from about amino acid 229 to about amino acid 247, from about amino acid 264 to about 

10 amino acid 285, from about amino acid 309 to about amino acid 330, from about amino acid 37 1 to about amino 
acid 390 and from about amino acid 441 to about amino acid 464, potential N-glycosylation sites from about 
amino acid 34 to about amino acid 37 and from about amino acid 387 to about amino acid 390 and an amino acid 
sequence block having homology to a respiratory-chain NADH dehydrogenase subunit from about amino acid 
243 to about amino acid 287. Clone DNA5771 1-1501 has been deposited with ATCC on Jury 1, 1998 and is 

15 assigned ATCC deposit no. 203047. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 144 (SEQ ID NO:216), evidenced significant 
homology between the PROl 152 amino acid sequence and the following Dayhoff sequences: AF052239_1, 
SYNN9CGAJ, SFCYTB2_1, GEN12507, PJU1769, MTV025J09, C61168, S43171, P_P61689 and 

20 P_P61696. 

jE^AMPLlT62ri^latiorro^ 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated 109142. This EST cluster sequence was then compared 

25 to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
and a proprietary EST DNA database (Lifeseq*, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al., M grt^ft ^ FtiTvrnology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 

30 DNA sequence with the program M phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA56039. 

In light of an observed sequence homology between the DNA56039 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. HSC1NF011, the Merck EST clone HSC1NF01 1 was 
purchased and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length 

35 protein. The sequence of this cDNA insert is shown in Figure 146 and is herein designated as DNA57827-1493. 

Clone DNA57827-1493) contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 216-218 and ending at the stop codon at nucleotide positions 21 12-21 14 (Figure 146). 
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The predicted polypeptide precursor is 632 amino acids long (Figure 147). The full-length PROU36 protein 
shown in Figure 147 has an estimated molecular weight of about 69,643 daltons and a pi of about 8.5. Analysis 
of the full-length PRO 1136 sequence shown in Figure 147 (SEQ ID NO:219) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 15 and potential N-grycosylation sites 
from ab(jut amino acid 108 to abowamiiioacMn. from about ainino acid 157 to about amino acid 160, from 
about amino acid 289 to about amino acid 292 and from aoott aniii» acid 384 to abc^ 
DNA57827-1493 has been deposited with ATCC on July 1, 1998 and is assigned ATCC deposit no. 203045. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35). using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 147 (SEQ ID NO:219). evidenced significant 
homology between the PR01136 amino acid sequence and the following Dayhoff sequences: AF034746J, 
AF034745J. MMAF000168J9, HSMUPP1J. AF060539_1, SP97_RAT. 138757, MMU93309J. 
CEK01A6_4 and HSA224747J. 

FY AMPLE 63: Isolation of c DNA clones Encoding Human PRQ8|3 

Use of me signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence (Incyte EST cluster sequence no. 45501. The Incyte EST cluster sequence no. 
45501 sequence was then compared to a variety of expressed sequence tag (EST) databases which included public 
EST databases (e.g. , GenBank) and a proprietary EST DNA database (LIFESEQ ™. Pharmaceuticals, Palo 
Alto, CA) to identify existing homologies. The homology search was performed using the computer program 
BLAST or BLAST2 (Altshul et al. , Mthnd* in Enzvmology 266:46(M80 (1996)). Those comparisons resulting 
in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle. Washington). The consensus sequence obtained therefrom is herein designated DNA56400. 

In light of an observed sequence homology between the DNA56400 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. T90592. the Merck EST clone T90592 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 148 and is herein designated DNA57834-1339. 

The full length clone shown in Figure 148 contained a single open reading frame with an apparent 
ttanslational initiation site at nucleotide positions 109- 1 1 1 and ending at the stop codon found at nucleotide 
positions 637-639 (Figure 149; SEQ ID NO:221). The predicted polypeptide precursor is 176 amino acids long. 

has a calculated molecular weight of approximately 19.616 daltons and an estimated pi of approximately 7. 11. 

Analysis of the full-length PR0813 sequence shown in Figure 149 (SEQ ID NO:221) evidences the presence of 

the following: a signal peptide from about amino acid 1 to about amino acid 26 and potential N-myristoylation 

sites from about amino acid 48 to about amino acid 53, from about amino acid 153 to about amino acid 158. 

from about amino acid 156 to about amino acid 16 1 and from about amino acid 167 to about amino acid 172. 

done DNA57834-1339 has been deposited with the ATCC on June 9. 1998 and is assigned ATCC deposit no. 

209954. 
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Analysis of the amino acid sequence of the full-length PR0813 polypeptide suggests that it possesses 
sequence similarity to the pulmonary surface-associated protein C. More specifically, an analysis of the 
Dayhoff database (version 35.45 SwissProt 35) evidenced some degree of homology between the PR0813 amino 
acid sequence and the following Dayhoff sequences, PSPC_MUSVI, P_P92071, G02964, P_R65489, P_P82977, 
P_R84555, S55542, MUSIGHAJJ and PH1158. 

5 

fX AMPLE 64: Ration of cDNA Clones Enco^ff f^ipm P^Q«» 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence. The Incyte EST cluster sequence was then compared to a variety of expressed 
sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary EST DN A 
10 database (LIFESEQ™, Incyte Kiarmaceuticals, Palo Alto, CA) to identify existing homologies. The homology 
search was performed using the computer program BLAST or BLAST2(Altshuletal., Methods in pnzymolom 
266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater 
that did not encode known proteins were clustered and assembled into a consensus DNA sequence with me 
program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
15 obtained therefrom is herein designated DNA56418. 

In light of an observed sequence homology between the DNA56418 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. H74302, the Merck EST clone H74302 was purchased 
and me cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 150 and is herein designated DNA57836-1338. 
20 The entire nucleotide sequence of DNA57836-1338 is shown in Figure 150 (SEQ ID NO:222). Clone 

DNA57836-1338 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 63-65 and ending at the stop codon at nucleotide positions 858-860 of SEQ ID NO:222 (Figure 150). 
The predicted polypeptide precursor is 265 amino acids long (Figure 151). The full-length PRO809 protein 
shown in Figure 151 has an estimated molecular weight of about 29,061 daltons andaplof about 9. 18. figure 
25 151 further shows the approximate positions of the signal peptide and N-glysosylation sites. The corresponding 
nucleotides can be determined by referencing Figure 150. Clone DNA57836-1338 has been deposited with 
ATCConJune23, 1998. It is understood that the deposited clone has the actual nucleic acid sequence and that 
the sequences provided herein are based on known sequencing techniques. 

Analysis of the axnino acid sequence of the full-length PRO809 polypeptide suggests that it possesses 
30 some sequence similarity to the heparin sulfate proteoglycan and to endothelial cell adhesion molecule- h More 
specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced sequence identity 
between the PRO809 amino acid sequence and the following Dayhoff sequences, PGBM _MOUSE, D82082 J 
and PW14158. 

35 EXAMPLE 65: Isolation of cDNA Clones Encoding Human PEQ7P1 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence. The Incyte EST cluster sequence was then compared to a variety of expressed 
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sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary EST DNA 
database (UFESEQ™. Incytc Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology 
search was performed using the computer program BLAST or BLAST2 < Altshul et al. , ElKYTPvlogY 
266:460480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater 
that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the 
5 program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56429. 

In light of an observed sequence homology between the DNA56429 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. 36367 , the Merck EST clone 36367 was purchased and 
the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
10 sequence of this cDNA insert is shown in Figure 152 and is herein designated DNA57838-1337. 

The entire nucleotide sequence of DNA57838-1337 is shown in Figure 152 (SEQ ID NO:224). Clone 
DNA57838-1337 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 9-11 and ending at the stop codon at nucleotide positions 747-749 of SEQ ID NO:224 (Figure 152). 
The predicted polypeptide precursor is 246 amino acids long (Figure 153). The full-length PROT91 protein 
15 shown in Figure 153 has an estimated molecular weight of about 27,368 daltons and a pi of about 7.45. Figure 
153 also shows the approximate locations of the signal peptide, the transmembrane domain, N-glycosylation 
sites and a region conserved in extracellular proteins. The corresponding nucleotides of one embodiment 
provided herein can be identified by referencing Figure 152. Clone DNA57838-1337 has been deposited with 
ATCConJune23. 1998. It is understood that the deposited clone has the actual nucleic acid sequence and that 
20 the sequences provided herein are based on known sequencing techniques. 

Analysis of the amino acid sequence of the full-length PR0791 polypeptide suggests that ft has sequence 
similarity with MHC-I antigens, thereby indicating that PR0791 may be related to MHC-I antigens. More 
specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced some sequent identity 
between the PR0791 amino acid sequence and the following Dayhoff sequences, AF034346 1, MMQ1K51 and 
25 HFEHUMAN. 

FV ample 66: "f f P NA cloneg Human PRQHXH 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence, Incyte cluster sequence No. 73681 . This EST cluster sequence was then compared 

30 to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) 
to identify existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (Altshul et al., M^hnHs in F.nzvmologv 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program "phrap" (Phil Green, Univ. of Washington. Seattle, 

35 Washington). The consensus sequence obtained therefrom is herein designated as DNA56516. 

In light of an observed sequence homology between the DNA56516 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. H43837 . the Merck EST clone H43837 was purchased 
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and the cDNA insert was obtained and sequenced. It was found that tins insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 154. 

The full length clone shown in Figure 154 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 119-121 and ending at the stop codon at nucleotide positions 
464-466 (Figure 154; SEQ ID NO:226). The predicted polypeptide precursor is 1 15 amino acids long (Figure 
5 155; SEQ ID NO:227). The full-length PRO1004 protein shown in Figure 155 has an estimated molecular 
weight of about 13,649 daltons and a pi of about 9.58. Analysis of the full-length PRO1004 sequence shown 
in Figure 155 (SEQ ID NO:227) evidences the presence of the following features: a signal peptide at about amino 
acids 1-24, a microbodies C-tenninal targeting signal at about amino acids 1 13-1 15, a potential N-grycosylatkm 
site at about amino acids 71-74, and a domain having sequence identity with dihydrofolate reductase proteins at 

10 about amino acids 22-48. 

Analysis of the amino acid sequence of the full-length PRO 1004 polypeptide using the Dayhoff database 
(version 35.45 SwissProt 35) evidenced homology between the PRO 1004 amino acid sequence and me following 
Dayhoff sequences: CELR02D3J7, LECIJ40USE, AF006691_3, SSZ97390J, SSZ97395J, and 
SSZ97400J. 

15 Clone DNA57844-1410 was aeposit^ 1998, and is assigned ATCC deposit 

no. 203010. 

EXAMPLE 67: Isolation of cDNA clones E ncoding Human PROl 1 1 1 

An expressed sequence tag (EST) DN A database (LIFESEQ*, Incyte Phanriaceuticals, Palo Alto, C A) 
20 was searched and an EST was identified which had homology to insulin-like growth factor binding protein. 

RNA for construction of cDNA libraries was isolated from human fetal brain. The cDNA libraries used 
~ to isolate the cDNA clones encoding human PROl ITT were constructed by stana^ methods using comrnercially - 
available reagents such as those from Invitrogen, San Diego, CA. The cDNA was primed with oligo dT 
containing a NotI site, linked with blunt to Sail hernikinased adaptors, cleaved with Nod, sized appropriately 
25 by gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or 
pRKD; pRK5B is a precursor of pRK5D that does not contain the Sfil site; see, Holmes et al.. Science, 
253:1278-1280 (1991)) in the unique Xhol and NotI. 

The human fetal brain cDNA libraries (prepared as described above), were screened by hybridization 
with a synthetic oligonucleotide probe based upon the Incyte EST sequence described above: 
30 5^CACCACCTGGAGGTCCTGCAGTrGGGCAGGAACTCCATCCGGCAGATTG-3^ (SEQ ID NO:251). 

An identified cDNA clone was sequenced in entirety. The entire nucleotide sequence of PROl 1 1 1 is 
shown in Figure 156 (SEQ ID NO:228). Clone DNA58721-1475 contains a single open reading frame with an 
apparent translational initiation site at nucleotide positions 57-59 and a stop codon at nucleotide positions 2016- 
2018 (Figure 156; SEQ ID NO:228). The predicted polypeptide precursor is 653 anuno acids long (Figure 157). 
35 The transmembrane domains are at positions 21-40 (type II) and 528-548. Clone DNA58721-1475 has been 
deposited with ATCC and is assigned ATCC deposit no. 203110. The full-length PROllll protein shown in 
Figure 157 has an estimated molecular weight of about 72,717 daltons and a pi of about 6.99. 
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An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 157 (SEQ ID NO:229), revealed some sequence 
identity between the PROl 1 1 1 amino acid sequence and the following Dayhoff sequences: A58532, D86983J . 
RNPLGPVJ , PGS2HUMAN, AF038127_1. ALS.MOUSE. GPVHUMAN, PGS2_BOVLN, ALS_PAPPA 
and 147020. 

5 

EXAMPLE 68: Ration of c p y A clones Encoding Human PRQ1344 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is herein designated DNA33790. Based on the DNA33790 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
10 the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PR01344. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5'-AGGTTCCTGATGGAGACAACCGCG-3' (SEQ ID NO:232) 
revere PCR primer 5'-TGTCAAGGACGCACTGCCGTCATG-3' (SEQ ID NO:233) 
15 Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA33790 
sequence which had the following nucleotide sequence 

hybririiratinn pmhe 

5'-TGGCCAGATCATCAAGCGTGTCTGTGGCAACGAGCGGCCAGCTCCTATCC-3' (SEQ ID NO:234) 
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
20 screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PRO 1344 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal kidney tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR01344 (designated herein as DNA58723-1588 [Figure 158, SEQ ID NO:230]); and the derived protein 
25 sequence for PR01344. 

The entire nucleotide sequence of DNA58723-1588 is shown in Figure 158 (SEQ ID NO:230). Clone 
DNA58723-1588 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 26-28 and ending at the stop codon at nucleotide positions 2186-2188 (Figure 158). The predicted 
polypeptide precursor is 720 amino acids long (Figure 159). The full-length PR01344 protein shown in Figure 
30 159 has an estimated molecular weight of about 80, 199 daltons and a pi of about 7.77. Analysis of the full- 
length PR01344 sequence shown in Figure 159 (SEQ ID NO:231) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about amino acid 23, an EGF-like domain cysteine protein signature 
sequence from about amino acid 260 to about amino acid 271, potential N-glycosylatkm sites from about amino 
acid 96 to about amino acid 99, from about amino acid 279 to about amino acid 282, from about amino acid 316 
35 to about amino acid 319, from about amino acid 451 to about amino acid 454 arid from about amino acid 614 
to about amino acid 617, an amino acid sequence block having homology to serine proteases, trypsin family from 
about amino acid 489 to about amino acid 505 and a CUB domain protein profile sequence from about amino 
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acid 150 to about ammo acid 166. Clone DNA58723-1588 has been deposited with ATCC on August 18, 1998 
and is assigned ATCC deposit no. 203133. 

An analysis of the Dayhoff database (version 3S.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 159 (SEQ ID NO:231), evidenced significant 
homology between the FR01344 amino acid sequence and the following Dayhoff sequences: S77063_l, 
5 CRAR_MOUSE, P_R74775, PJP90070, P_R09217 t P_P70475, HSBMP16J and U50330J. 

EXAMPLE 69: Isolation of cDNA clones Encoding Human PRO1109 

A consensus DN A sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is herein designated DNA52642. Hie consensus DNA sequence 

1 0 was obtained by extending using repeated cycles of BLAST and phrap a previously obtained consensus sequence 
as far as possible using the sources of EST sequences discussed above. Based on the DNA52642 consensus 
sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA libraiy that contained the sequence 
of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PROl 109. 
PCR primers (forward and reverse) were synthesized: 

15 forward PCR primer 5 ' -CCTT ACCTC AGAGGCC AG AGC AAGC-3 ' (SEQ ID NO:237) 
reverse PCR primer 5 * -G ACKJTTC ATCCGTTCTGCGTTC ACC-3 ' (SEQ ID NO:238) 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA52642 
sequence which had the following nucleotide sequence 
hyfrridizajion probe 

20 5 , -CAGGAATGTAAAGC^TTACAGA(KJGTCGCCATCCTCGTTCCCCACC-3 , (SEQ ID NO:239) 

In order to screen several libraries for a source of a full-length clone* DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PROl 109 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDN A libraries was isolated from human SK-Lu- 1 adenocarcinoma cell tissue (LIB247). 

25 DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PROl 109 (designated herein as DNA58737-1473 [Figure 160, SEQ ID NO:235]) and the derived protein 
sequence for PROl 109. 

The entire nucleotide sequence of DNA58737-1473 is shown in Figure 160 (SEQ ID NO: 235). Clone 
DNA58737-1473 contains a single open reading frame with an apparent translational initiation site at nucleotide 

30 positions 119-120 and ending at the stop codon at nucleotide positions 1151-1153 (Figure 160). The predicted 
polypeptide precursor is 344 amino acids long (Figure 161). The full-length PROl 109 protein shown in Figure 
161 has an estimated molecular weight of about 40,041 daltons and a pi of about 9.34. Analysis of the full- 
length PROl 109 sequence shown in Figure 161 (SEQ ID NO:236) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about amino acid 27, potential N-glycosylation sites from about amino 

35 acid 4 to about amino acid 7, from about amino acid 220 to about amino acid 223 and from about amino acid 
335 to about amino acid 338 and an amino acid sequence block having homology to xylose isomerase proteins 
from about amino acid 191 to about amino acid 201. Clone DNA58737-1473 has been deposited with ATCC 
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cm August 18, 1998 and is assigned ATCC deposit no. 203136. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 161 (SEQ ID NO:236), evidenced significant 
homology between the PRO 1109 amino acid sequence and the following Dayhoff sequences: HSUDPGAL.l, 
HSUDPB14J, NALS.BOVW, HSU10473_1, CEW02B12_11, YNJ4CAEEL, AE000738J1, CET24D1_1, 
5 S48121andCEGLY9_l. 

EXAMPLE 70: Tmlation of cD NA clones Encoding Human PR01383 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is herein designated DNA53961. Based on the DNA53961 
10 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library mat contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PR01383. 

PCR primers (forward and reverse) were synthesized: - 
f^nt PCR primer 5-CATTTCCTTACCCTGGACCCAGCTCC-3' (SEQ ID NO:242) 
15 r^re PCR primer 5'-GAAAGGCCCACAGCACATCTGGCAG-3' (SEQ IDNO:243) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA53961 
sequence which had the following nucleotide sequence 
hvhrirliMlion probe 

5'-CCACGAC(XGAGCAACTTCCTC^AGACCGACnTGTTTCTCTACAGC-3' (SEQ ID NO:244) 
20 In order to screen several libraries for a source of a full-length clone. DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR01383 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal brain tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
25 PRQ1383 (designated herein as DNA58743-1609 [Figure 162. SEQ ID NO: 240]) and the derived protein 
sequence for PR01383. 

The entire nucleotide sequence of DNA58743-1609 is shown in Figure 162 (SEQ ID NO:240). Clone 
DNA5 8743-1609 contains a single open reading frame with an apparent relational initiation site at nucleotide 
positions 122-124 and ending at the stop codon at nucleotide positions 1391-1393 (Figure 162). The predicted 

30 polypeptide precursor is 423 amino acids long (Figure 163). The full-length PR01383 protein shown in Figure 
163 has an estimated molecular weight of about 46,989 daltons and a pi of about 6.77. Analysis of the full- 
length PR01383 sequence shown in Figure 163 (SEQ ID NO:241) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about amino acid 24, a transmembrane domain from about amino acid 
339 to about amino acid 362, and potential N-glycosylation sites from about amino acid 34 to about amino acid 

35 37. from about amino acid 58 to about amino acid 61. from about amino acid 142 to about amino acid 145. from 
about amino acid 197 to about amino acid 200. from about amino acid 300 to about amino acid 303 and from 
about amino acid 364 to about amino acid 367. Clone DNA58743-1609 has been deposited with ATCC on 
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August 25, 1998 and is assigned ATCC deposit no. 203154. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 163 (SEQ ID NO:241), evidenced significant 
homology between the PRO 1383 amino acid sequence and the following Dayhoff sequences: NMB HUMAN, 
QNR_COTJA, PJV38335, PI 15_CHICK, PJV38164, A45993J, MMU70209J, D83704J andP_W39176. 

5 

EXAMPLE 71: Isolation of cDNA Clones Encoding Human PRO10Q3 

Use of die signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence designated herein as 43055. This sequence was then compared to a variety of EST 
databases which included public EST databases (e.g., GenBank) and a proprietary EST DNA database 

10 (UFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search 
was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods i n Firrymotoyv 
266:460-480(1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater 
that did not encode known proteins were clustered and assembled into a consensus DNA sequence with die 
program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 

15 obtained therefrom is herein designated consenOl . 

In light of an observed sequence homology between the consensus sequence and an EST sequence 
encompassed within the Incyte EST clone no. 2849382, the Incyte EST clone 2849382 was purchased and the 
cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 164. 

20 The entire nucleotide sequence of DNA58846-1409 is shown in Figure 164 (SEQ ID NO:245). Clone 

DNA58846-1409 contains a single open reading frame with an apparent translational initiation site at nucleotide 

positions 41-43-and ending at-to^ 

polypeptide precursor is 84 amino acids long (Figure 165). The full-length PRO 1003 protein shown in Figure 
165 has an estimated molecular weight of about 9,408 daltons and a pi of about 9.28. Analysis of the full-length 

25 PRO 1003 sequence shown in Figure 165 (SEQ ID NO:246) evidences the presence of a signal peptide at amino 
acids 1 to about 24, and a cAMP- and cGMP-dependent protein kinase phosphorylation site at about amino acids 
58 to about 61 . Analysis of the amino acid sequence of the full-length PRO 1003 polypeptide using the Dayhoff 
database (version 35.45 SwissProt 35) evidenced homology between the PRO1003 amino acid sequence and the 
following Dayhoff sequences: AOPCZA363J, SRTXATREN, A48298, MHVJHMSJ. VGL2CVMJH, 

30 DHDHTC22, CORTRAT. TAL6_HUMAN, P_W14123, and DVUFI 2. 

EXAMPLE 72: Isolation of cDNA Clones Encoding Human PROl IPS 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is herein designated DNA53237. 
35 In light of an observed sequence homology between the DNA53237 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 2379881, the Incyte EST clone 2379881 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
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The sequence of this cDNA insert is shown in Figure 166 and is herein designated DNA58848-1472. 

The entire nucleotide sequence of DNA58848- 1472 is shown in Figure 166 (SEQ ID NO:247). Clone 
DNA58848-1472 contains a single open reading frame with an apparent translanonal initiation site at nucleotide 
positions 77-79 and ending at the stop codon at nucleotide positions 1445-1447 (Figure 166). The predicted 
polypeptide precursor is 456 amino acids long (Figure 167). The full-length PRO 1 108 protein shown in F.gure 
167 has an estimated molecular weight of about 52,071 daltons and a pi of about 9.46. Analysis of the foil- 
length PRO1108 sequence shown in Figure 167 (SEQ ID NO:248) evidence, the presence of the foUowing:tvpe 
II transmembrane domains from abom anmio acid 22 to ab<™ amino arid 42, from about amino acid 156 to 
about amino acid 176, from about amino acid 180 to about amino add 199 ar^ frc« about amino acid 369 to 
about amino add 388. potential N-glycosytaic* sites from ab^ 

about amino acid 327 to about amino acid 330, from about amino acid 328 to about amino acid 331 and from 
about amino acid 362 to about amino add 365 and an amino acid block having homology to ER lumen protein 
retaining receptor protein from about amino acid 153 to about amino acid 1 90. done DNA58848-1472 has been 
deposited with ATCC on June 9. 1998 and is assigned ATCC deposit no. 209955. 

Analysis of the amino acid sequence of the full-length PROl 108 polypeptide suggests that h possesses 
significant sequence similarity to the LPAAT protein, thereby indicating that PROl 108 may be a novel LPAAT 
homolog. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced 
significant homology between the PROl 108 amino add sequence and the following Dayhoff sequences. 
AF015811_1. CER07E3_2, YL35CAEEL, S73863. CEF59F4.4. P.W06422. MMU4t736_l. MTV008.39. 
P R99248 and Y67 BPT7. 

FX AMPLE 73: lotion of m WA Clones Enrndinp Human PROl 137 

The extracellular domain (ECD) sequences (including the secretion signal, if any) of from about 950 
known secreted proteins from the Swiss-Prot public protein database were used to search expressed sequence 
tag (EST) databases. The EST databases included public EST databases (e.g., GenBank) and a proprietary EST 
DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto. CA). The search was performed using the 
computer program BLAST or BLAST2 (Altshul et al.. Methods in Enzvmologv 266.:4«M80 (1996)) as a 
comparison of the ECD protein sequences to a 6 frame translation of the EST sequence. Using this procedure, 
tacyte EST No. 3459449. also referred to herein as "DNA7108". was identified as an EST having a BLAST 
score of 70 or greater that did not encode a known protein. 
) A consensus DNA sequence was assembled relative to the DNA7108 sequence and other ESTs using 

repeated cycles of BLAST and the program "pnrap" (Phil Green. Univ. of Washington. Seatde. WA). The 
consensus sequence obtained therefrom is referred to herein as DNA53952. 

In light of an observed sequence homology between the DNA53952 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 3663102. the Incyte EST clone 3663102 was purchased 
5 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 168. 
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The entire nucleotide sequence of DNA58849-1494 is shown in Figure 168 (SEQ ID KO:249). Clone 
DNA58849-1494 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 77-79 and ending at the stop codon at nucleotide positions 797-799 (Figure 168). The predicted 
polypeptide precursor is 240 amino acids long (Figure 169). The full-length PROl 137 protein shown in Figure 
169 has an estimated molecular weight of about 26,064 daltons and a pi of about 8.6S. Analysis of the full- 
5 length PROl 137 sequence shown in Figure 169 (SEQ ID NO:250) evidences the presence of a signal peptide 
at about amino acids 1 to 14 and a potential N-glycosylation site at about amino acids 101-105. 

Analysis of die amino acid sequence of the full-length PROl 137 polypeptide suggests that it possesses 
significant sequence similarity to ribosyltransferase thereby indicating that PROl 137 may be a novel member 
of die ribosyltransferase family of proteins. Analysis of the amino acid sequence of the full-length PROl 137 
10 polypeptide using the Dayhoff database (version 35.45 SwissProt 35) evidenced homology between the PROl 137 
amino acid sequence and the following Dayhoff sequences: MMART5J, NARG_MOUSE, GEN11909, 
GEN13794, GEN 14406, MMRNART62J, and P_R41876. 

EXAMPLE 74: Isolation of cDNA clones Encoding Human PROt 1M 

15 Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

Incyte EST sequence, Incyte cluster sequence no. 165212. This cluster sequence was then compared to a variety 
of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a 
proprietary EST DNA database (UFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 

20 et al.. Methods in Enzvmologv 26fr46a4«Q noofiiv Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 

DNA sequence with the progr^ 

consensus sequence obtained therefrom is herein designated as DNA54224. The assembly included a proprietary 
Genentech EST designated herein as DNA49140 (Figure 172; SEQ ID NO:254). 

25 In light of an observed sequence homology between the DNA54224 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 3836613, the Incyte EST clone 3836613 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 170 and is the full-length DNA sequence for PROl 138. 
Clone DNA58850-1495 was deposited with the ATCC on June 9, 1998, and is assigned ATCC deposit no. 

30 209956. 

The entire nucleotide sequence of DNA58850-1495 is shown in Figure 170 (SEQ ID NO:252). Clone 
DNA58850-1495 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 38-40 and ending at the stop codon at nucleotide positions 1043-1045 (Figure 170). The predicted 
polypeptide precursor is 335 amino acids long (Figure 171). The full-length PROl 138 protein shown in Figure 
35 171 has an estimated molecular weight of about 37,421 Daltons and a pi of about 6.36. Analysis of the full- 
length PROl 138 sequence shown in Figure 171 (SEQ ID NO:253) evidences the presence of the following 
features: a signal peptide at about amino acid 1 to about amino acid 22; a transmembrane domain at about amino 
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acids 224 to about 250; a leucine zipper pattern at about amino acids 229 to aboul 250; and potential N- 
grycosylation sites at about amino acids 98-101, 142-145. 148-151, 172-175. 176-179. 204-207. and 291-295. 

Analysis of the amino acid sequence of the full-length PROl 138 polypeptide suggests that it possesses 
significant sequence similarity to the CD84, thereby indicating that PROl 138 may be a novel member of the Ig 
sur^rfarmly of polypeptides. More particularly, analysis of the amino acid sequence of the full-length PROl 138 
rxrfypeptideusir^ttaDavbof^ 

an^acklse^andthefollowingDayhoff *o^enc«:HSU82988_l.HUMLY9_l.P_R97631. P_IWfi28. 
P R97629, P R97630, CD48_RAT, CD2_HUMAN. P_P93996, and HUMBGPJ. 

CloneDN A58850-1495 was deposited with ATCC on June 9, 1998. and is assigned ATCC deposit no. 

209956. 
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EXAMPLE 75: ftntation of cDNA clones n»™w«ti ff Human PRO1054 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated 66212. This EST cluster sequence was then compared 
to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g. . GenBank) 
15 and a proprietary EST DNA database (LIFESEQ*, Incyte Pharmaceuticals, Palo Alto. CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al. , Method, m F.nzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green. University of Washington. Seattle. Washington). The 
20 consensus sequence obtained therefrom is herein designated DNA55722. 

In light of an observed sequence homology between the DNA55722 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 319751, the Incyte EST clone 319751 was purchased 
and the cDNA insert was obtained and sequenced. 1, was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 173 and is herein designated as DNA58853-1423. 
25 Clone DNA58853-1423 contains a single open reading frame with an apparent translational initiation 

site at nucleotide positions 46-18 and ending at the stop codon at nucleotide positions 586-588 (Figure 173). The 
predictedtrtrypeptidep 174 >- The full-length PRO 1054 protein shown 

in Figure 174 has an estimated molecular weight of about 20.638 daltons and a pi of about 5.0. Analysis of the 
full-length PRO 1054 sequence shown in Figure 174 (SEQ ID NO:256) evidences the presence of the following: 
30 a signal peptide from about amino acid 1 to about amino acid 18, a leucine zipper pattern from about amino acid 
155 to about amino acid 176 and amino acid sequence blocks having homology to lipocalin proteins from about 
amino acid 27 to about amino acid 38 and from about amino acid 110 to about amino acid 120. Clone 
DNA58853-1423 has been deposited with ATCC on June 23. 1998 and is assigned ATCC deposit no. 203016. 
An analysis of the Dayhoff database (version 35.45 SwissProt 35). using a WU-BLAST2 sequence 
35 alignment analysis of the full-length sequence shown in Figure 174 (SEQ ID NO:256>. evidenced significant 
homology between the PRO1054 amino acid sequence and the following Dayhoff sequences: MUPl.MOUSE. 
MUP6 MOUSE. MUP2 MOUSE. MUP8_MOUSE. MUP5.MOUSE. MUP4.MOUSE. S10124. 
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MUPMMOUSE, MUP_RAT and ECU70823J. 

EXAMPLE 76: Isolation of cDNA clones Encoding Human PR0994 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated 157555. This EST cluster sequence was then compared 

5 to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
and a proprietary EST DNA database (LIFESEQ*, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 

10 DNA sequence with the program "phrap* (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA55728. 

In light of an observed sequence homology between the DNA55728 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2860366, the Incyte EST clone 2860366 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

15 The sequence of this cDNA insert is shown in Figure 175 and is herein designated as DNA58855-1422. 

Clone DNA58855-1422 contains a single open reading frame with an apparent translational initiation 
site at nucleotide posiuoiis 31-33 and endi^ 175). The 

predicted polypeptide precursor is 229 amino acids long (Figure 176). The full-length PR0994 protein shown 
in Figure 176 has an estimated molecular weight of about 25, 109 da] tons and a pi of about 6.83. Analysis of 

20 the full-length PR0994 sequence shown in Figure 176 (SEQ ID NO:258) evidences the presence of the 
following: transmembrane domains from about amino acid 10 to about amino acid 31, from about amino acid 
SOtoabout amiribacid 72, rrom^b6m~amino acid 87 to about amino acid 110 ami from abo 
to about amino acid 213, potential N-glycosylation sites from about amino acid 80 to about amino acid 83, from 
about amino acid 132 to about amino acid 135, from about amino acid 148 to about amino acid 151 and from 

25 about amino acid 163 to about amino acid 166 and an amino acid block having homology to TNFR/NGFR 
cysteine-rich region proteins from about amino acid 4 to about amino acid 1 1 . Clone DNA58855- 1422 has been 
deposited with ATCC on June 23, 1998 and is assigned ATCC deposit no. 203018. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 176 (SEQ ID NO:258), evidenced significant 

30 homology between the PR0994 amino acid sequence and the following Dayhoff sequences: AF027204_1, 
TAL6JHUMAN, ILT4_HUMAN, JC6205, MMU57570J, S40363, ETU56093J, S42858, P_R66849 and 
P_R74751. 

EXAMPLE 77: Isolation of cDNA clones Encoding Human PRQ812 
35 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

cluster sequence from the Incyte database, designated 170079. This EST cluster sequence was then compared 
to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
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and a proprietary EST DNA database (Lifeseq*, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
^nH.inPnrvmologv 266:460480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program «phrap» (Phil Green. University of Washington, Seattle, Washington). Hie 
5 consensus sequence obtained therefrom is herein designated as DNA55721. 

In light of an observed sequence homology between the DNA55721 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 388964. the Incyte EST clone 388964 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 177 and is herein designated as DNA59205-1421. 
10 Clone DNA59205-1421 contains a single Open reading frame with an apparent translational initiation 

site at nucleotide positions 55-57 and ending at the stop codon at nucleotide positions 304-306 (Figure 177). The 
predicted polypeptide precursor is 83 arnino acids long (Figure 178). The full-length PR0812 protein shown 
in Figure 178 has an estimated molecular weight of about 9,201 daltons and a pi of about 9.3. Analysis of the 
full-length PROS12 sequence shown in Figure 178 (SEQ ID NO:260) evidences the presence of the following: 
15 a signal peptide from abenrt amir« acm 1 to about aniino acid 15. a cAMP- and cGMP-<lepencent protein kinase 
phosphorylation site from about armno acm 73 to arx^ amino acid 76 and protein kinase C phosphorylation sites 
from arx>utamirx, acid 70 to about ar^ 0006 
DNA59205-1421 has been deposited with ATCC on June 23, 1998 and is assigned ATCC deposit no. 203009. 
An analysis of the Dayhoff database (version 35.45 SwissProt 35). using a WU-BLAST2 sequence 
20 alignment analysis of the full-length sequence shown in Figure 178 (SEQ ID NO:260), evidenced significant 
homology between the PROS12 amino acid sequence and the following Dayhoff sequences: P.W35802. 
P_W35803. PSC1_RAT. S68231, GEN13917. PSC2.RAT. CC10_HUMAN.UTER_RABIT. AF008595J and 
A56413. 

25 fx AMPLE 78: frrt l tiim " f riPNA clones Encoding H«nwn PRQ1Q69 

Use of the signal sequence algorithm described in Example 3 above allowed ider^fication of a single 
Incyte EST sequence designated herein as 100727. This sequence was then compared to a proprietary EST 
DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al.. Methods in 
30 Enzvjr^logy 266:460480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, Univ. of Washington, Seattle, Washington). The consensus sequence obtained 
therefrom is herein designated DNA56001. 

In light of an observed sequence homology between the DNA56001 consensus sequence and an EST 
35 sequence encompassed within the Incyte EST clone no. 3533881. the Incyte EST clone 3533881 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 179 and is the full-length DNA sequence for PRO1069. 
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Clone DNA5921 1-1450 was deposited with the ATCC on June 9, 1998, and is assigned ATCC deposit no. 
209960. 

The entire nucleotide sequence of DNA5921 1-1450 is shown in Figure 179 (SEQ ID NO:261). Clone 
DNA5921 1-1450 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 197-199 and ending at the stop codon at nucleotide positions 464-466. The predicted polypeptide 

5 precursor is 89 amino acids long (Figure 180). The full-length PRO1069 protein shown in Figure 180 has an 
estimated molecular weight of about 9,433 daltons and a pi of about 8.21 . Analysis of the full-length PRO 1069 
sequence shown in Figure 180 (SEQ ID NO:262) evidences the presence of the following features: a signal 
peptide sequence at amino acid 1 to about 16; a transmembrane domain at about amino acids 36 to about 59; 
potential N-myristoylation sites at about amino acids 41-46, 45-50, and 84-89; and homology with extracellular 

10 proteins SCP/Tpx-l/Ag5/PR-l/Sc7 at about amino acids 54 to about 66. 

Analysis of the amino acid sequence of the full-length PRO1069 polypeptide suggests that it possesses 
significant sequence similarity to CHIF, thereby indicating that PRO 1069 may be a member of the CHIF family 
of polypeptides. More particularly, analysis of die amino acid sequence of the full-length PRO1069 polypeptide 
using the Dayhoff database (version 35.45 SwissProt 35) evidenced homology between the PRO1069 amino acid 

15 sequence and the following Dayhoff sequences: CHIFRAT, A55571, PLMHUMAN, A40533, 
ATNG_BOVIN, RICMOUSE, PETD_SYNY3, VTB1_XENLA, A05009, and S75086. 

Clone DNA5921 1-1450 was deposited with the ATCC on June 9, 1998, and is assigned ATCC deposit 
no. 209960. 

20 EXAMPLE 79: Isolation of cDN A Clones Encoding Human PROl 129 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Inc^ EST cluster sequence designated herem 
was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g., GenBank) and a proprietary EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) 
25 to identify existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (Altshul et al., Methods in Enzvmoloev 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program u phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56038. 
30 In light of an observed sequence homology between the DNA56038 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 1335241 , the Incyte EST clone 1335241 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 181 and is herein designated DNA59213-1487. 

The full length clone shown in Figure 181 contained a single open reading frame with an apparent 
35 translational initiation site at nucleotide positions 42-44 and ending at the stop codon found at nucleotide positions 
1614-1616 (Figure 181; SEQ ID NO:263). The predicted polypeptide precursor is 524 amino acids long, has 
a calculated molecular weight of approximately 60,310 daltons and an estimated pi of approximately 7.46. 
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Analysis of the full-length PROl 129 sequence shown in Figure 182 (SEQ ID NO:264) evidences the presence 
of the following: type II transmembrane domains from about amino acid 13 to about amino acid 32 and from 
about amino acid 77 to about amino acid 102, a cytochrome P-450 cysteine heme-iron ligand signature sequence 
from about amino acid 461 to about amino acid 470 and potential N-glycosylation sites from about amino acid 
112 to about amino acid 1 15 and from about amino acid 168 to about amino acid 171. Clone DNA59213-1487 
has been deposited with the ATCC on June 9, 1998 and is assigned ATCC deposit no. 209959. 

Analysis of the amino acid sequence of the full-length PRO 1 129 polypeptide suggests mat it possesses 
sequence similarity to the cytochrome P-450 family of proteins. More specifically, an analysis of the Dayhoff 
database (version 35.45 SwissProt 35) evidenced some degree of homology between the PROl 129 amino acid 
sequence and the following Dayhoff sequences, AC004523 _1, S45702, AF054821 J and 153015. 

EXAMPLE 80: Isolation of c DN A clones Encoding Human PRO 1068 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the UFESEQ* database, designated Incyte cluster no. 141736. This EST cluster sequence 
was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g., GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to 
identify existing homologies. One or more of the ESTs was derived from a human mast cell line from a patient 
with mast cell leukemia. The homology search was performed using the computer program BLAST or BLAST2 
(Altsbul et al., Methrvk in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score 
of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a 
consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, 
Washington). The consensus sequence obtained therefrom is herein designated DNA56094. 

In light of an observed sequence homology between the DNA56094 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 004974, the Incyte EST clone 004974 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 183 and is herein designated as DNA59214-1449 (SEQ 
ID NO:265). 

The full length clone shown in Figure 183 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 42-44 and ending at the stop codon found at nucleotide positions 
414-416 (Figure 183; SEQ ID NO:265). The predicted polypeptide precursor (Figure 184, SEQ ID NO:266) 
is 124 amino acids long. PRO1068 has a calculated molecular weight of approximately 14,284 daltons and an 
estimated pi of approximately 8.14. The PRO1068 polypeptide has the following additional features, as 
indicated in Figure 184: a signal peptide sequence at about amino acids 1-20, a urotensin II signature sequence 
at about amino acids 118-123, a cell attachment sequence at about amino acids 64-66, and a potential cAMP- 
and cGMP-dependent protein kinase phosphorylation site at about amino acids 112-115. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 184 (SEQ ID NO:266), revealed homology 
between the PRO1068 amino acid sequence and the following Dayhoff sequences: HALBOPJ, MTV043J6, 
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150498, and P_R78445 

Clone DNA59214-1449 was deposited with the ATCC on July 1, 1998 and is assigned ATCC deposit 
no.203O46. 

PXAMPLE81: Isolation of cDNA clones Encoding Hu man PRO 1066 

Use of die signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence designated herein as 79066. The Incyte EST cluster sequence no. 79066 sequence 
was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g., GenBank) and a proprietary EST DNA database (LIFESEQ™, Incyte Phannaceuticals, Palo Alto, CA) 
to identify existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (Altshul et aL, Methods in EnzvmologV 266:460-480 (1996)). Those cmnparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program -phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56121. 

In light of an observed sequence homology between the DNA5612 1 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 1515315, the Incyte EST clone 1515315 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 185 and is herein designated DNA59215-1425. 

The full length clone shown in Figure 185 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 176-178 and ending at the stop codon found at nucleotide 
positions 527-529 (Figure 185; SEQ ID NO:267). The predicted polypeptide precursor is 1 17 amino acids long, 
has a calculated molecular weight of approximately 12,91 1 daltons and an estimated pi of approximately 5.46. 
Analysis of the full-lei^ PROlO^ saquence shown in FlgureT86 (SEQ ID ^O!268)^videno^the presence 
of the following: a signal peptide from about amino acid 1 to about amino acid 23, a cAMP- and cGMP- 
dependent protein kinase phosphorylation site from about amino acid 38 to about amino acid 41 and potential 
N-myristoylation sites from about amino acid 5 to about amino acid 10, from about amino acid 63 to about amino 
acid 68 and from about amino acid 83 to about amino acid 88. Clone UNQ524 (DNA5921S-1425) has been 
deposited with the ATCC on June 9, 1998 and is assigned ATCC deposit no. 209961. 

Analysis of the amino acid sequence of the full-length PRO1066 polypeptide suggests that it does not 
possess significant sequence similarity to any known human protein. However, an analysis of the Dayhoff 
database (version 35.45 SwissProt 35) evidenced some degree of homology between the PRO 1066 amino acid 
sequence and the following Dayhoff sequences, MOTTHUMAN, AF025667J, MTCY19H9_8 and 
RABIGKCHJ. 

EXAMPLE 82: Isolation of cDNA Clones Encoding Human PRO 11 84 

Use of the signal sequence algorithm described in Example 3 on ESTs from an Incyte database allowed 
identification a candidate sequence designated herein as DNA56375. This sequence was then compared to a 
variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and 
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a proprietary EST DNA database (LIFESEQ™, Incyte Phannaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Ahshul 
et al., p>rtinrt« in Rnrvmologv 266:460480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
5 consensus sequence obtained therefrom is herein designated DNA56375. 

In light of an observed sequence homology between the DNA56375 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 1428374, the Incyte EST clone 1428374 was purchased 
and the cDNA insert was obtained and sequenced. It was found mat mis insert encoded a Mi-length protein. 
The sequence of this cDNA insert is shown in Figure 187. 
10 The full length clone shown in Figure 187 contained a single open reading frame with an apparent 

transitional initiation site at nucleotide positions 106-108 and ending at the stop codon found at nucleotide 
positions 532.534 (Figure 187; SEQ ID NO:269). The predicted polypeptide precursor is 142 amino adds long, 
has a calculated molecular weight of approximately 15,690 daltons and an estimated pi of approximately 9.64. 
Analysis of me full-length PROl 184 sequence shown in Figure 188 (SEQ ID NO:270) evidences the presence 
15 of a signal peptide at about amino acids 1-38. Clone DNA59220-1514 has been deposited with the ATCC on 
June 9, 1998. It is understood that the deposited clone has the actual sequences and that representations arc 
- presented herein. 

Analysis of the amino acid sequence of the full-length PROl 184 polypeptide suggests that it possesses 
some sequence identity with a protein called TIM from Drosophila virilis. designated "DVTIMS021 " in the 
20 Dayhoff database, (version 35.45 SwissProt 35). Other 

Dayboff database (version 35.45 SwissProt 35) sequences having some degree of sequence identity with 
PROH84 include: WISl_SCHPO, F002186J. ATAC00239124 and MSAIPRPJ. 

EX AMPLE 83: Isolation of cP NA clones Encoding Human PRQ136Q 
25 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

sequence from an Incyte database, designated DNA10572. This EST sequence was then compared to a variety 
of expressed sequence tag (EST) databases which included public EST databases (e.g.. GenBank. Merck/Wash. 
U.) and a proprietary EST DNA database (UFESEQ*, Incyte Pharmaceuticals, Palo Alto, CA) to identify 
existing homologies. The homology search was performed using the computer program BLAST or BLAST2 
30 (Altshul et al.. MPthnrts in F.nzvmologv 266:46CM80 (1996)). Those comparisons resulting in a BLAST score 
of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a 
consensus DNA sequence with the program -phrap- (Phil Green. University of Washington. Seattle. 
Washington). The consensus sequence obtained therefrom is herein designated DNA57314. 

In light of an observed sequence homology between the DNA573 14 consensus sequence and an EST 
35 sequence encompassed within the Merck EST clone no. AA406443. the Merck EST clone AA406443 was 
purchased and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length 
protein. The sequence of this cDNA insert is shown in Figure 189 and is herein designated as DNA59488-1603. 
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The full length clone shown in Figure 189 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 54-56 and ending at die stop codon found at nucleotide positions 
909-91 1 (Figure 189; SEQ ID NO:271). The predicted polypeptide precursor (Figure 190, SEQ ID NO:272) 
is 285 amino acids long. PRO1360 has a calculated molecular weight of approximately 31,433 daltons and an 
estimated pi of approximately 7.32. Clone DNA59488-1603 was deposited with the ATCC on August 25, 1998 
5 and is assigned ATCC deposit no. 203157. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 190 (SEQ ID NO:272), revealed sequence identity 
between die PRO1360 amino acid sequence and the following Dayhoff sequences: UN51_CAEEL, 
YD4B_SCHPO, AF000634J, GFO_ZYMMO, YEIJSCHPO, D86566J, ZMGFOJ, S76976, 
10 PPSA_SYNY3, and CEF28B1_4. 

EXAMPLE 84: Isolation of cDNA clones Encoding Human PRO1029 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated 18763. This EST cluster sequence was then compared 

15 to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
and a proprietary EST DNA database (UFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 

20 DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA57854. 

Inlighrbf ah^bserved"^u£hre"hmholog^ 
sequence encompassed within the Merck EST clone no. T98880, the Merck EST clone T98880 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

25 The sequence of this cDNA insert is shown in Figure 191 and is herein designated as DNA59493-1420. 

Clone DNA59493-1420 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 39-41 and ending at the stop codon at nucleotide positions 297-299 (Figure 191). The 
predicted polypeptide precursor is 86 amino acids long (Figure 192). The full-length PRO 1029 protein shown 
in Figure 192 has an estimated molecular weight of about 9,548 daltons and a pi of about 8.52. Analysis of the 

30 full-length PRO1029 sequence shown in Figure 192 (SEQ ID NO:274) evidences the presence of the following: 
a signal peptide from about amino acid 1 to about amino acid 19, an amino acid block having homology to 
bacterial rhodopsins retinal binding site protein from about amino acid 50 to about amino acid 6 1 , a prenyl group 
binding site from about amino acid 83 to about amino acid 86 and a potential N-glycosylarion site from about 
amino acid 45 to about amino acid 48. Clone DNA59493-1420 has been deposited with ATCC on July 1, 1998 

35 and is assigned ATCC deposit no. 203050, 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 192 (SEQ ID NO:274), evidenced significant 
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homology between the PRO1029 amino acid sequence and the following Dayhoff sequences: S66088, 
AF031815J, MM4A6L_1, PSEIS52a-l, S 17699 and P_R63635. 

EXAMPLE 85: Isolation of cDNA clones Encoding Human PRQ1139 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
5 cluster sequence from the Incyte database, designated 4461 . This EST cluster sequence was then compared to 
a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and 
a proprietary EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al., M»fr~l« " Pnzvmoloev 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
10 in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA57312. 

The DNA57312 consensus sequence included a 172 nucleotides long public EST (T62095, 
Merck/University of Washington public database). This EST clone, identified herein as a putative protein coding 
15 sequence, was purchased from Merck, and sequenced to provide the coding sequence of PRO 1139 (Figure 193). 
As noted before, the deduced amino acid sequence of DNA59497- 1496 shows a significant sequence identity with 
the deduced amino acid sequence of HSOBRGRPJ . The full-length protein (Figure 194) contains a putative 
signal peptide between amino acid residues 1 and about 28. and three putative transmembrane domains 
(approximate amino acid residues 33-52, 71-89, 98-120). 

20 

EXAMPLE 86: Isolation of cD NA clones Encoding Human PRO1309 

An expressed sequence tag (EST) DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto. CA) 
was searched and an EST was identified which showed homology to SLIT. 

RNA for construction of cDNA libraries was isolated from human fetall brain tissue. The cDNA 
25 libraries used to isolate the cDN A clones encoding human PRO 1309 were constructed by standard methods using 
commercially available reagents such as those from Invitrogen. San Diego, CA. The cDNA was primed with 
oligo dT containing a NotI site, linked with blunt to Sail nemikinased adaptors, cleaved with NotI, sized 
appropriately by gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as 
pRKB or pRKD; pRK5B is a precursor of pRK5D that does not contain the Sfil site; see. Holmes et al., §cjenc£, 
30 253:1278-1280 (1991)) in the unique Xhol and NotI. 

The cDNA libraries (prepared as described above), were screened by hybridization with a synthetic 
oligonucleotide probe derived from the above described Incyte EST sequence: 

5'-TCCGTGCAGGGGGACGCCrTTCAGAAACTGCGCCGAGTTAAGGAAC-3' (SEQ ID NO:279). 

A cDNA clone was isolated and sequenced in entirety. The entire nucleotide sequence of DNA59588- 
35 1571 is shown in Figure 195 (SEQ ID NO:277). Clone DNA59588-1571 contains a single open reading frame 
with an apparent traaslational initiation site at nucleotide positions 720-722 and a stop codon at nucleotide 
positions 2286-2288 (Figure 195; SEQ ID NO:277). The predicted polypeptide precursor is 522 amino acids 
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long. The signal peptide is approximately at 1-34 and the transmembrane domain is at approximately 428-450 
of SEQ ID NO:278. Clone DNA5958M571 has been deposited with ATCC and is assigned ATCC deposit no. 
203106. The full-length PRO1309 protein shown in Figure 196 has an estimated molecular weight of about 
58,614 daltons and a pi of about 7.42. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
5 alignment analysis of the full-length sequence shown in Figure 196 (SEQ ID NO:278), revealed sequence identity 
between the PRO 1309 amino acid sequence and the following Dayhoff sequences: AB007876 1,GPV MOUSE, 
ALSJRAT, P_R85889, LUMCHICK, AB014462J, PGS1_CANFA, CEM88J7, A58532 and GEN 11209. 

EXAMPLE 87: Isolation of cDNA Clones Encndinp Human PRO 102ft 

10 Use of die signal sequence algorithm described in Example 3 above allowed identification of a certain 

EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (UFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using die computer program BLAST or BLAST2 (Altshul et al., Methods in 

15 Enzymology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA59603. 

In light of an observed sequence homology between the DNA59603 sequence and an EST sequence 

20. contained within Incyte EST clone no. 1497725, the Incyte EST clone no. 1497725 was purchased and the cDNA 
insert was obtained and sequenced. It was found that the insert encoded a full-length protein. The sequence of 

this cPNA.insert.is.shown in.Figure 197 and b herem designated as DNA59^03-14 19. 

The entire nucleotide sequence of DNA59603-1419 is shown in Figure 197 (SEQ ID NO:280). Clone 
DNA59603-1419 contains a single open reading frame with an apparent translational initiation site at nucleotide 

25 positions 21-23 and ending at the stop codon at nucleotide positions 612-614 (Figure 197). The predicted 
polypeptide precursor is 197 amino acids long (Figure 198). The mil-length PRO1028 protein shown in Figure 
198 has an estimated molecular weight of about 20,832 daltons and a pi of about 8.74. Clone DNA59603-1419 
has been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains 
the correct sequence, and the sequences provided herein are based on known sequencing techniques. 

30 Analyzing the amino acid sequence of SEQ ID NO:281 , the putative signal peptide is at about amino 

acids 1-19 of SEQ ID NO: 281 . An N-glycosylation site is at about amino acids 35-38 of SEQ ID NO:281 . A 
C-type lectin domain is at about amino acids 108-1 17 of SEQ ID NO:281 , indicating that PR0513 may be related 
to or be a lectin. The corresponding nucleotides of these amino acid sequences or others can be routinely 
determined given the sequences provided herein. 

35 
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EXAMPLE 88: Ration of cDNA Clones Encoding Human I 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a certain 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
5 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al. , Methods m 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56399. 
10 In light of an observed sequence homology between the DNA56399 sequence and an EST sequence 

contained within Incyte EST clone no. 937605. the Incyte EST clone no. 937605 was purchased and the cDNA 
insert was obtained and sequenced. It was found that the insert encoded a full-length protein. The sequence of 
mis cDNA insert is shown in Figure 199 and is herein designated as DNA59605-1418. 

The entire nucleotide sequence of DNA59605-1418 is shown in Figure 199 (SEQ ID NO:282). Clone 
15 DNA59605-1418 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 31 _33 and ending at the stop codon at nucleotide positions 262-264 (Figure 199). The predicted 
polypeptide precursor is 77 amino acids long (Figure 200). The full-length PRO1027 protein shown in Figure 
200 has an estimated molecular weight of about 8,772 daltons and a pi of about 9.62. Clone DNA59605-1418 
has been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains 
20 the correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Analyzing the amino acid sequence of SEQ ID NO: 283, the putative signal peptide is at about amino 
acids 1-33 of SEQ ID NO:283. The type II fibronectin collagen-binding domain begins at about amino acid 30 
of SEQ ID NO:283. The corresponding nucleotides for these amino acid sequences and others can be routinely 
determined given the sequences provided herein. PRO1027 may be involved in tissue formation or repair. 
25 The foUowing Dayhoff designations appear to have some sequence identity with PRO1027: 

SFT2YEAST; ATM3E9_2; A69826; YM16_MARPO; E64896; U60193_2; MTLRAJ205 1; MCU60315_70; 
SPAS SHTFL; and S54213. 



FX AMPLE 89: y«m a rinn of cDNA Clyr*s Encoding Human PRO1107 

30 Use of the signal sequence algorithm described in Example 3 above allowed identification of a certain 

EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (LIFESEQ*. Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods hi 

35 Enzvniologv 266:46(M80 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap' (Phil Green. University of Washington, Seattle, Washington). The consensus sequence 
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obtained therefrom is herein designated DNA56402. 

In light of an observed sequence homology between the DNA56402 sequence and an EST sequence 
contained within Incyte EST clone no. 3203694, the lncyte EST clone no. 3203694 was purchased and the cDNA 
insert was obtained and sequenced. It was found that the insert encoded a full-length protein. The sequence of 
this cDNA insert is shown in Figure 201 and is herein designated as DNA59606-1471. " 

5 The entire nucleotide sequence of DNA59606-1471 is shown in Figure 201 (SEQ ID NO:284). Clone 

DNA59606-1471 contains a single open reading frame with an apparent translational initiation she at nucleotide 
positions 244-246 and ending at the stop codon at nucleotide positions 1675-1677 of SEQ ID NO:284 (Figure 
201). The predicted polypeptide precursor is 477 amino acids long (Figure 202). The full-length PROl 107 
protein shown in Figure 202 has an estimated molecular weight of about 54,668 daltons and a pi of about 6.33. 

10 Clone DNA59606-1471 has been deposited with ATCC on June 9, 1998. It is understood that the deposited 
clone has the actual nucleic acid sequence and that the sequences provided herein are based on known 
sequencing techniques. 

Analysis of the amino acid sequence of the full-length PROl 107 polypeptide suggests that it possesses 
Qignifiram sequence similarity to phosphodiesterase I/nucleotide phy rophosphatase , human insulin receptor 
15 tyrosine kinase inhibitor, alkaline phosphodiesterase and autotaxin, thereby indicating that PROl 107 may have 
at least one or all of the activities of these proteins, and that PROl 107 is a novel phosphodiesterase. More 
specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced sequence identity 
between the PROl 107 amino acid sequence and at least the following Dayhoff sequences: AF005632 1, 
P_R79148, RNU78787J, AF060218_4, A57080 and HUMATXTJ . 

20 

EXAMPLE 90: Isolation of cDNA clones Encoding Human PROl 140 

Use oflhe^ignaTseqpeiice^ single" 
Incyte EST sequence, Incyte cluster sequence No. 135917. This sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 

25 EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program u phrap w (Phil Green, Univ. of Washington, Seattle, Washington). The consensus sequence obtained 

30 therefrom is herein designated DNA56416. 

In light of an observed sequence homology between DN A564 1 6 and an EST sequence contained within 
Incyte EST clone no. 3345705, Incyte EST clone no. 3345705 was obtained and its insert sequenced, ft was 
found that die insert encoded a full-length protein The sequence, designated herein as DNA59607-1497, which 
is shown in Figure 203, is the full-length DNA sequence for PROl 140. Clone DNA59607-1497 was deposited 

35 with the ATCC on June 9, 1998, and is assigned ATCC deposit no. 209946. 

The entire nucleotide sequence of DNA59607-1497 is shown in Figure 203 tSEQ ID NO:286). Clone 
DNA59607-1497 contains a single open reading frame with an apparent translational initiation site at nucleotide 
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positions 210-212 and ending at the stop codon at nucleotide positions 975-977 (Figure 203). The predicted 
polypeptide precursor is 255 amino acids long (Figure 204). The full-length PRO 11 40 protein shown in Figure 
204 has an estimated molecular weight of about 29,405 daltons and a pi of about 7.64. Analysis of the full- 
length PRO1140 sequence shown in Figure 204 (SEQ ID NO:287) evidences the presence of three 
transmembrane domains at about amino acids 101 to 118, 141 to 161 and 172 to 191. 

Malysisofthe amino acid sequ^ 
(version35.45SwissProt 35) evidenced homology between the PROl 140 anm^acWset^ and me Mc^ 
Dayhoff sequences: AF023602_1. AF000368_1. CIN3_RAT, AF003373_1. GEN13279, and AF003372J. 

Clone DNA59607-1497 was deposited with the ATCC on June 9. 1998. and is assigned ATCC deposit 

no. 209946. 



10 



pv AMPLE 91r l r latj Tll rf«i>WA clones Fncodine Human PROMOS 

Use of the signal sequence algorithm described in Example 3 above allowed identification of. a single 
tocyte EST sequence. This sequence was then compared to a variety of expressed sequeiice tag (EST) databases 
which included public EST databases (e.g., GenBank) and aproprietary EST DNA database (LIFESEQ™, Incyte 
15 Pharmaceuticals. Palo Alto, CA) to identify existing homologies. The homology search was performed using 
the computerprogram BLAST or BLAST2 (Altshul et al., Mlhods in EnzymplogY 266:46(M80 (1996)). Those 
comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known 
proteins were clustered and assembled into a consensus DNA sequence with the program "pbrap" (Phil Green. 
Univ. of Washington. Seatde. Washington). The consensus sequence obtained therefrom . is herein designated 
20 DNA56423. 

In light of an observed sequence homology between DNA56423 and an EST sequence contained withm 
tocyte EST clone no. 1711247, Incyte EST clone no. 1711247 was obtained and its insert sequenced. It was 
found that the insert encoded a full-length protein The sequence, designated herein as DNA596O9-1470, which 
is shown in Figure 205. is the full-length DNA sequence for PROl 106. Clone DNA59609-1470 was deposited 
25 with the ATCC on June 9, 1998, and is assigned ATCC deposit no. 209963. 

The entire nucleotide sequence of DNA59609-1470 is shown in Figure 205 (SEQ ID NO:288). Clone 
DN A59609-1470 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positicmsei^and ending at me stop codon atnucleotide posits 

The predicted polypeptide precursor is 469 amino acids long (Figure 206). The full-length PROl 106 protein 
30 shown in Figure 206 has an estimated molecular weight of about 52,689 daltons and a pi of about 8.68. It is 
understood that the skilled artisan can construct the polypeptide or nucleic acid encoding therefor to exclude any 
one or more of all of these domains. For example, the transmembrane domain region© and/or either of the 
amino terminal or carboxyl end can be excluded. Clone DNA59609-1470 has been deposited with ATCC on 
June9, 1998. It is understood that the deposited clone has the actual nucleic acid sequence and that the 
35 sequences provided herein are based on known sequencing techniques. 

Analysis of the amino acid sequence of the fuU-length PROl 106 polypeptide suggests that it possesses 
significant sequence similarity to the peroxisomal ca-dependent solute carrier, thereby indicating that PROl 106 
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may be a novel transporter. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 
35) evidenced sequence identity between the PRO 1106 amino acid sequence and at least the following Dayhoff 
sequences, AF004161J, IG002N01_25, GDCBOVIN and BT1_MAIZE. 

EXAMPLE 92: Isolation of cDNA clones Encoding Human PRQ1291 
5 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

cluster sequence from the Incyte database, designated 120480. This EST cluster sequence was then compared 
to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
and a proprietary EST DNA database (Lifeseq*, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 

10 et al„ Methods in Enzvmoloev 266:460-480 (1996)>. Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA56425. 

In light of an observed sequence homology between the DNA56425 sequence and an EST sequence 

15 encompassed within the Incyte EST clone no. 2798803, the Incyte EST clone 2798803 was purchased and the 
cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. Hie 
sequence of this cDNA insert is shown in Figure 207 and is herein designated as DNA59610-1556. 

Clone DNA59610-1556 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 61-63 and ending at the stop codon at nucleotide positions 907-909 (Figure 207). The 

20 predicted polypeptide precursor is 282 amino acids long (Figure 208). The full-length PR01291 protein shown 
in Figure 208 has an estimated molecular weight of about 30,878 daltons and a pi of about 5.27. Analysis of 

— — the-full-lOTgth-PR01291-sequeira^ 

following: a signal peptide from about amino acid 1 to about amino acid 28, a transmembrane domain from about 
amino acid 258 to about amino acid 281 and potential N-glycosylation sites from about amino acid 1 12 to about 

25 amino acid 115, torn about amino acid 160 to about amino acid 163, from about amino acid 190 to about amino 
acid 193, from about amino acid 196 to about amino acid 199, from about amino acid 205 to about amino acid 
208, from about amino acid 216 to about amino acid 219 and from about amino acid 220 to about amino acid 
223.. Clone DNA596 10-1556 has been deposited with ATCC on June 16, 1998 and is assigned ATCC deposit 
no. 209990. 

30 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

alignment analysis of the full-length sequence shown in Figure 208 (SEQ ID NO:291), evidenced significant 
homology between the PR01291 amino acid sequence and the following Dayhoff sequences: HSU90S52_1, 
HSU90144J, AF033107J, HSB73J, HSU90142J, GGCD80J, PJV34452, MOG.MOUSE, B39371 and 
P_R71360. 

35 
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EXAMPLE 93r l«""ti"H <*>NA clone* Encoding Human PRO HQS 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g.. GenBank) and a proprietary 
EST DNA database (Lifeseq* incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 

5 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al.. Methods ft 
F.nTvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56430. 

10 In light of an observed sequence homology between the DNA56430 sequence and an EST sequence 

encompassed within the Incyte EST clone no. 1853047, the Incyte EST clone 1853047 was purchased and the 
cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 209 and is herein designated as DNA59612-1466. 

The entire nucleotide sequence of DNA59612-1466 is shown in Figure 209 (SEQ ID NO:292). Clone 

15 DNA59612-1466 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 28-30 and ending at the stop codon at nucleotide positions 568-570 of SEQ ID NO:292 (Figure 209). 
The predicted polypeptide precursor is 180 amino acids long (Figure 210). The full-length PROl 105 protein 
shown in Figure 210 has an estimated molecular weight of about 20,040 daltons and a pi of about 8.35. Clone 
DNA59612-1466 has been deposited with the ATCC on June 9, 1998. It is understood that the deposited clone 

20 has the actual nucleic acid sequence and that the sequences provided herein are based on known sequencing 
techniques. 

Analyzing Figure 210, a signal peptide is at about amino acids 1-19 of SEQ ID NO:293 and 
transmembrane domains are shown at about amino acids 80-99 and 145-162 of SEQ ID NO:293. It is 
understood that the skilled artisan could form a polypeptide with all of or any combination or individual selection 
25 of these regions. It is also understood that the corresponding nucleic acids can be routinely identified and 
prepared based on the information provided herein. 

F.X AMPLE 94: Isolation of cDN A clones E ncoding Human PROS 1 1 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

30 cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (Lifeseq*, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 

35 or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program -phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56434. 
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In light of an observed sequence homology between the DNA56434 sequence and an EST sequence 
encompassed within the Incyte EST clone no. 1227491, the Incyte EST clone 1227491 was purchased and the 
cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of mis cDNA insert is shown in Figure 211 and is herein designated as DNA59613-1417. 

The entire nucleotide sequence of DNA59613-1417 is shown in Figure 211 (SEQ ID NO:294). Cldbe 
5 DNA39613-14 17 contains a single open reading frame with an apparent transladonal initiation site at nucleotide 
positions 233-235 and ending at the stop codon at nucleotide positions 944-946 (Figure 211). The predicted 
polypeptide precursor is 237 amino acids long (Figure 212). The full-length PR0511 protein shown in Figure 
212 has an estimated molecular weight of about 25,284 daltons and a pi of about 5.74. Clone DNA596 13-1417 
has been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains 
10 the correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Analyzing the amino acid sequence of SEQ ID NO:295, the putative signal peptide is at about amino 
acids 1-25 of SEQ ID NO:295. The N-glycosylation sites are at about amino acids 45-48, 73-76, 107-1 10, 1 18- 
121, 132-135, 172-175, 175-178 and 185-188 of SEQ ID NO:295. An arthropod defensins conserved region 
is at about amino acids 176-182 of SEQ ID NO:295. A kringle domain begins at about amino acid 128 of SEQ 
15 ID NO:295 and a ly-6/u-PAR domain begins at about amino acid 6 of SEQ ID NO:295. The corresponding 
nucleotides of these amino acid sequences and others can be routinely determined given the sequences provided 
herein. 

The designations appearing in a Dayhoff database with which PR0511 has some sequence identity are 
as follows: SSC20F10J; SF041083; P_W26579; S44208; JC2394; F5TAJHCDI; A27020; S59310; 
20 RAG1RABIT; and MUSBALBC1J . 

EXAMPLE 95 - Isolation of cDN A clones Encoding Human PRO! 104 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
25 expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (Lifeseq® Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et ah, Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
30 the program "phrap* (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56446. 

In light of an observed sequence homology between the DNA56446 sequence and an EST sequence 
encompassed within the Incyte EST clone no. 2837496, the Incyte EST clone 2837496 was purchased and the 
cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
35 sequence of this cDNA insert is shown in Figure 213 and is herein designated as DNA59616-1465. 

The entire nucleotide sequence of DNA59616-1465 is shown in Figure 213 (SEQ ID NO:296). Clone 
DNA59616-1465 contains a single open reading frame with an apparent translational initiation site at nucleotide 
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positions 109-111 and ending at the stop codon at nucleotide positions 1132-1134 of SEQ ID NO:296 (Figure 
213). The predicted polypeptide precursor is 341 amino acids long (Figure 214). The full-length PRO1104 
protein shown in Figure 2 14 has an estimated molecular weight of about 36,769 daltons and a pi of about 9.03 . 
Clone DNA59616-1465 has been deposited with ATCC on June 16, 1998. It is understood mat the deposited 
clone has the actual nucleic acid sequence and that the sequences provided herein are based on known 

5 sequencing techniques. 

Anaryzing Figure 214, a signal peptic ^ N ' 
.nyristoylatkm sites are a, about amino acids 41-46. 110-115. 133-138. 167-172 and 179-184 of SEQ ID 
NO-.297. 

10 EXAMPLE 96: Isolation of c DKA clones Fnrodinp Human, PRQUW 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
duster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (Lifeseq*. Incyte Pharmaceuticals, Palo Alto. CA) to identify existing homologies. The 
15 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al.. MSbfiSiiia 
Pomology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington. Seattle, Washington). 

In light of an observed sequence homology between the obtained consensus sequence and an EST 
20 sequence encompassed within the Incyte EST clone no. 2305379, the Incyte EST clone 2305379 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 215 and is herein designated as DNA59619-1464. 

The entire nucleotide sequence of DNA59619-1464 is shown in Figure 215 (SEQ ID NO:298). Clone 
DNA59619-1464 contains a single open reading frame with an apparent translational initiation site at nucleotide 
25 positions 33-35 and ending at the stop codon at nucleotide positions 993-995 of SEQ ID NO-.298 (Figure 215). 
The predicted polypeptide precursor is 320 amino acids long (Figure 216). The full-length PRO 1100 protein 
shown in Figure 216 has an estimated molecular weight of about 36,475 daltons and a pi of about 7.29. Clone 
DNA59619-1464 has been deposited with ATCC on July 1 , 1998. It is understood that the deposited clone has 
the actual nucleic acid sequence and that the sequences provided herein are based on known sequencing 
30 techniques. 

Upon analyzing SEQ ID NO:299, the approximate locations of the signal peptide, the transmembrane 
domains, an N-glycosylation site, an N-myristoylation site, a CUB domain and an anriloride-sensitive sodium 
charinel domain are present. It is believed that PRO 1 100 may function as a channel. The corresponding nucleic 
acids for these amino acids and others can be routinely determined given SEQ ID NO:299.. 

35 
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EXAMPLE 97: Isolation of cDNA clones Encoding Human PRQ836 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (Lifeseq* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
5 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al. t Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the progr am "phrap* (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained is herein designated DNA56453. 

10 In light of an observed sequence homology between the DNA56453 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 2610075, the Incyte EST clone 2610075 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 217 and is herein designated as DNA59620-1463. 

The entire nucleotide sequence of DNA59620-1463 is shown in Figure 217 (SEQ ID NO: 300). Clone 

15 DNA59620-1463 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 65-67 and ending at the stop codon at nucleotide positions 1448- 1450 of SEQ ID NO:300 (Figure 217). 
The predicted polypeptide precursor is 461 amino acids long (Figure 218). The full-length PR0836 protein 
shown in Figure 218 has an estimated molecular weight of about 52,085 daltons and a pi of about 5.36. Analysis 
of the full-length PR0836 sequence shown in Figure 218 (SEQ ID NO: 301) evidences the presence of the 

20 following: a signal peptide, N-glycosylation sites, N-myristoylation sites, a domain conserved in the 
YJLI26w/YLR351c/yhcX family of proteins, and a region having sequence identity with SLS1. Clone 

DNA59620-1463 has been deposited with-ATGC on June 16, 1998. It is inideretood that the de^ 

has the actual nucleic acid sequence and that the sequences provided herein are based on known sequencing 
techniques. 

25 Analysis of the amino acid sequence of the full-length PR0836 polypeptide suggests that it possesses 

some sequence similarity to SLS1, thereby indicating that PR0836 may be involved in protein translocation of 
the ER. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced some 
homology between the PR0836 amino acid sequence and at least the following Dayhoff sequences, S58132, 
SPBC3B9J, S66714, CRU40057J and IMACAEEL. 

30 

EXAMPLE 98: Isolation of cDNA clones Encoding Human PROl 141 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated 11873. This EST cluster sequence was then compared 
to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
35 and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
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in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program -phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA56518. 

In light of an observed sequence homology between the DNA56518 consensus sequence and an EST 
sequence encompassed within the Irtcyte EST clone no. 2679995 , the Incyte EST clone 2679995 was purchased 
5 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 219 and is herein designated as DNA59625-1498. 

Clone DNA59625-1498 contains a single open reading frame with an apparent translatkmal initiation 
site at nucleotide positions 204-206 ar^ encling at the stop codon at nucleotide positions 945-947 (Figure 219). 
The predicted polypeptide precursor is 247 amino acids long (Figure 220). The full-length PRO 1141 protein 
10 ^*J^m*m*tim*Am*^^*^1W^rt**«+"™- 

of the full-length PROl 141 sequence shown in Figure 220 (SEQ ID NO:303) evidences the presence of the 
following: a signal peptide torn about am™ acW 1 to about aniino acid 1 9 and transmembrane domains from 
about amino acid 38 to about amino acid 57, from about amino acid 67 to about amino acid 83, from about 
amino acid 117 to about amino acid 139 and from about amino acid 153 to about amino acid 170. Clone 
15 DNA59625-1498 has been deposited with ATCC on June 16, 1998 and is assigned ATCC deposit no. 209992. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35). using a WU-BLAST2 sequence 
aUgnment analysis of the full-length sequence shown in Figure 220 (SEQ ID NO:303). evidenced significant 
homology between the PROl 141 amino acid sequence and the following Dayhoff sequences: CEVF36H2L2, 
PCRB7PRJ__1. AB000506J, LEU95008J. MRU87980J5, YIGMECOL1. STU65700_1, GHU62778J, 
20 CYST_SYNY3 and AF009567_1. 

EXAMPLE 99: T « o' atin " of c D N A clmM!g Encoding Human PRQM32 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
fa Example 1 above. This consensus sequence is designated herein as DNA35934. Based on the DNA35934 
25 consensus sequence, ougonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
mc sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PROl 132. 

PCR primers (forward and reverse) were synthesized: 
w^PPRnrimer: 5'-TCCTGTOACCACCCCTCTAACACC-3' (SEQ ID NO:310) and 
30 ~- PCR .rimer - 5'-CTGGAACATCTGCTGCCCAGATTC-3' (SEQ ID NO:311). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
sequence which had the following nucleotide sequence: 

5--GTCGGATGACAGCAGCAGCCGCATCATCAATGGATCCGACTGCGATATGC-3- (SEQ ID NO:312). 
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
35 screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PROl 132 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal kidney. 
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DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PROH32 and the derived protein sequence for PR01132. 

The entire nucleotide sequence of PR01132 is shown in Figure 225 (SEQ ID NO.308). Clone 
DNA59767- 1489 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 354-356 and a stop codon at nucleotide positions 1233-1235 (Figure 225; SEQ ID NO:308). The 
5 predicted polypeptide precursor is 293 amino acids long. The signal peptide is at about amino acids 1-22 and 
the histidinc active site is at about amino acids 104-109 of SEQ ID NO:309. Clone DNA59767-1489 has been 
deposited with ATCC (having the actual sequence rather than representations based on sequencing techniques 
as presented herein) and is assigned ATCC deposit no. 203108. The full-length FR01132 protein shown in 
Figure 226 has an estimated molecular weight of about 32,020 daltons and a pi of about 8.7. 
10 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

alignment analysis of the full-length sequence shown in Figure 226 (SEQ ID NO:309), revealed sequence identity 
between the PROl 132 amino acid sequence and the following Dayhoff sequences: SSU76256_1, P_W10694, 
MMAE0006636, AF013988J, U66061J, MMAE000665_2, MMAE00066415, MMAE00066414, 
MMAE000665_4 and MMAE00066412. 

15 

EXAMPLE 100: Isolation of cD^A clones Encoding Human NL7 fPRO|?46) . 

A single EST sequence (#1398422) was found in the LIFESEQ* database as described in Example 1 
above. This EST sequence was renamed as DNA45668. Based on the DNA45668 sequence, oligonucleotides 
were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use 
20 as probes to isolate a clone of the full-length coding sequence for NL7. 
PCR primers (forward and reverse) were synthesized: 

forward PCR primer: 5 , ^CACACGTCCAACCTCAATGGGCAG^ , (SEP ID NO:315) 

reverse PCR primer : 5*-GACCAGCAGGGCCAAGGACAAGG-3' (SEQ ID NO:316) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
25 DNA45668 sequence which had the following nucleotide sequence: 
hybridization probe: 

5 , <JTTCTCTGAGATGAAGATCCGGCCGGTCCGGGAGTACCGCTTAG-3• 
(SEQIDNO:317) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
30 screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the NL7 gene using the probe oligonucleotide and one of the PCR primers. RNA for 
construction of the cDNA libraries was isolated from a human fetal kidney library (UB227). 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for NL7 
(designated herein as DNA59776-1600 [Figure 227, SEQ ID NO:3 13]) and the derived protein sequence for NL7 
35 (PR01346). 

The entire coding sequence of NL7 (PR01346) is shown in Figure 227 (SEQ ID NO:313). Clone 
DNA59776-1600 contains a single open reading frame with an apparent translational initiation site at nucleotide 
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positions 1-3 antf an apparent stop codon at nucleotide positions 1384-1386. The prcdkted polypeptide precursor 
is461aminoacidslong. The protein contains an apparent type II transmembrane domain at amino acid positions 
from about 31 to about 50; fibrinogen beta and gamma chains C-terminal domain signature starting at about 
amino acid position 409, and a leucine zipper pattern starting at about amino acid positions 140, 147. 154 and 
161. respectively. Clone DNA59776-1600 has been deposited with ATCC and is assigned ATCC deposit no. 
5 203128. The full-length NL7 protein shown in Figure 228 has an estimated molecular weight of about 50.744 

daltons and a pi of about 6.38. 

Based on a WU-BLAST2 sequence alignment analysis (using the WU-BLAST2 computer program) of 
the full-length sequence, NL7 shows significant amino acid sequence identity to a human microfibril-associated 
glycoprotein (1 MF A4HUMAN) ; to known TIE-2 ligands and ligand homologues, ficolin, serum lectin and 
10 TGF-1 binding protein. 

EXAMPLE 101: TfffilffHnn of cDNA clones Encoding Human PROU3I 

A cDNA sequence isolated in the amylase screen described in Example 2 above is herein designated 
DNA43546 (see Figure 231; SEQ ID NO:320). The DNA43546 sequence was then compared to a variety of 
15 expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Ahshul etal.. Methods in 
Pnrvmolo g v 266:46(M80 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into consensus DNA sequences with 
20 the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA45627 . 

Based on the DNA45627 sequence, oligonucleotide probes were generated and used to screen a human 
library prepared as described in paragraph 1 of Example 2 above. The cloning vector was pRK5B (pRK5B is 
a precursor of pRK5D that does not contain the Sfil site; see. Holmes et al., Science 253:1278-1280 (1991)), 
25 and the cDNA size cut was less than 2800 bp. 

PCR primers (forward and 2 reverse) were synthesized: 
forward PCR miner S'-ATGCAGGCCAAGTACAGCAGCAC-3' (SEQ ID NO:321); 
reverse PCR primer 1 S'-CATGCTGACGACTTCCTGCAAGC-S' (SEQ ID NO:322); and 
rgy.™ PCR primer 1 5'-CCACACAGTCTCTGCTTCTTGGG-3' (SEQ ID NO:323) 
30 Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA45627 

sequence which had the following nucleotide sequence: 
hybridisation probe 

5'-ATGCTGGATGATGATGGCK}ACACCACCATGAGCCTGCATT-3' (SEQ ID NO:324). 

In order to screen several libraries for a source of a full-length clone. DNA from the libraries was 
35 screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR01131 gene using the probe oligonucleotide and one of the PCR primers. 
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A full length clone was identified that contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 144-146, and a stop signal at nucleotide positions 984-986 
(Figure 229; SEQ ID NO:318). The predicted polypeptide precursor is 280 amino acids long, has a calculated 
molecular weight of approximately 3.1,966 daltons and an estimated pi of approximately 6.26. The 
transmembrane domain sequence is at about 49-74 of SEQ ID NO:319 and the region having sequence identity 
5 with LDL receptors is about 50-265 of SEQ ID NO:319. PROl 131 contains potential N-linked glycosylation 
sites at amino acid positions 95-98 and 169-172 of SEQ ID NO:319. Clone DNA59777-1480 has been deposited 
with the ATCC and is assigned ATCC deposit no. 203111. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown inpigure 230 (SEQ ID NO:3 1 9), evidenced some sequence 
10 identity between the PROl 131 amino acid sequence andthe following Dayhoff sequences: AB010710 1,149053, 
149115, RNU56863J, LY4A_MOUSE, 155686, MMU56404J, 149361, AF030313J and MMU09739J. 

EXAMPLE 102: Isolation of cDNA clones Encoding Human PRQ1281 

A consensus DN A sequence was assembled relative to other EST sequences using phrap as described 
15 in Example 1 above. This consensus sequence is designated herein as DNA35720. Based on the DNA35720 
sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the y^mcy 
of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PRO 1281 . 

PCR primers (forward and reverse) were synthesized: 
forward PCR primers: 
20 5 *-TGG AAGGCTGCCGCAACGACAATC-3 ' (SEQ ID NO:327); 
5'<HtiATGTGGCCGATGTTCTG-3' (SEQ ID NO:328); and 

5'-ATGGGTCAGTGTGCAGACAG-3' (SEQ ID NO:329). " 

reverse PCR primers: 

S'-GCATGCTGCTCCGTGAAGTAGTCC-S* (SEQ ID NO:330); and 
25 5 '-ATGCATGGG AAAGAAGGCCTGCCC-3 1 (SEQ ID NO:331). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA35720 sequence 
which had the following nucleotide sequence: 
hybridization probe: 

S'-TGCACrGGTGACCACGAGGGGGTGCACTATAGCCATCTGGAGCTGAG-S' (SEQ ID NO:332). 
30 In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pairs identified above. A positive library was then used 
. to isolate clones encoding the PR01281 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDN A libraries was isolated human fetal liver. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
35 PR01281 (designated herein as DNA59820-1549 [Figure 232, SEQ ID NO: 325]; and the derived protein 
sequence for PR01281 . 
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V*^^'^'*™™*^*^™^™™ 325 *- Clone DNA59820- 
15 49 contains a single open reading frame with an apparent translation*! initiation site at nucleotide postoons 
^230 and an apparent stop codon at .ucleodde positions 2553-2555. The predicted polypeptide prectusor 
is 775 amino acids long. The full-length PR01281 protein shown in Figure 233 has an estimated molcc^ 
weight of about 85,481 daltons and a pi of about 6.92. Additional features include a signal peptide at about 
5 amino acids 1-15; and potential N-grycosylation sites at about amino acids 138-141 and 361-364. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

CEC38H2J. CAC2_HAECO, B3A2.HUMAN. S22373. CEF38A3.2. CEC34F62. CEC34F6.3, and 
10 CELT22 ^^ 3 DNA59820 _ 1M9 ^ ^ depo ^ A wim ATCC and is assigned ATCC deposit no. 203129. 

py ample 103: '-rtmur "T«™a drmw Fncodinr Human PRQ»W j _ 

A cDN A sequence isolated in the amylase screen described in Example 2 above was found, by the WU- 
15 BLAST2 sequence alignment computer program, to have no significant sequence identity to any knownhuman 
p^in. This cDNA sequence is herein designated DNA45288. The DN A45288 sequence was 
^Various EST databases including public EST databases (e.g., GenBank). and a proprietary EST daubase 
(LIFESEQ* incyte Pharmaceuticals, Palo Alto. CA) to identify homologous EST sequences. Tr* comparison 
was performed using the computer program BLAST or BLAST2 (Altschul et al., Mrfclh H» 1» 
20 266:46<M80 (1996)]. Those comparisons resulting in a BLAST score of 70 (or in some cases, 90) or greater 
^ did n« encode known proteins were clustered and assemb.ed into a consensus DNA sequenc w,th the 
program "phrap' (Phil Green, University of Washington, Seattle, Washington). This consensus sequence . 
^designated DNA4S609. Oligonucleotide primers based upon the DNA48609 s^uence were *en 
synthesized and employed to screen a human fetal kidney cDNA Horary which resulted in the idenuficauon of 
25 meDNA59827-1426 clone shown in Hgure 234. The cloning vector was pRKSB (pRK5B is a precursor of 

size cut was less man 2800 bp. 

The oligonucleotide probes employed were as follows: 
^ PTE nrimer 5'-CTGAGACCCTGCAGCACCATCTG-3' (SEQ ID NO:336) 
30 reyerje_PCR_EQffiSI 5'-GGTGCTTCTTGAGCCCCACTTAGC-3' (SEQ ID NO:337) 

^*mZo*. a synthetic oligonucleotide hybridization probe was constructed from the consensus 
DNA48609 sequence which had the following nucleotide sequence 
ViriHWation «°be 

5^AATCTAGCTTCTCCAGGACTGTGGTCGCCCCGTCCGCTGT-3' (SEQ ID NO:338) 
35 A full length clone was identified that contained a single open reading frame wim an apparent 

national initiation she at nucleotide positions 532-534 and a stop signal a, nucleotide positions 991-993 
(Figure 234, SEQ ID NO:333). Tnc predicted polypeptide precursor is 153 amino acids long, has a calculated 
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molecular weight of approximately 17,3 17 daltons and an estimated pi of approximately 5. 17. Analysis of die 
full-length PRO1064 sequence shown in Figure 235 (SEQ ID NO:334) evidences the presence of die following: 
a signal peptide from about amino acid 1 to about amino acid 24, a transmembrane domain from about amino 
acid 89 to about amino acid 1 10, an indole-3-glycerol phosphate synthase homology block from about amino acid 
74 to about amino acid 105 and a Myb DNA binding domain protein repeat protein homology block from about 
5 amino acid 1 14 to about amino acid 137. Clone DNA59827-1426 has been deposited with ATCC on August 4 t 
1998 and is assigned ATCC deposit no. 203089. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 235 (SEQ ID NO:334), evidenced homology 
between the PRO1064 amino acid sequence and the following Dayhoff sequences: MMNP15PRO 1, 
10 BP187PLYHJ, CEIJ*2G8j^MMU58888J,GEN14270 t ^ 
SESENODAJ and A21467J. 

EXAMPLE 104: Isolation of cDNA clones En coding Human PRfHT7Q 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
15 in Example 1 above. This consensus sequence is designated herein DNA45232. Based on the DNA45232 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PR01379. 

PCR primers (forward and reverse) were synthesized: 
20 forward PCR primer 5 ' -TGGAC ACCGT ACCCTGGTATCTGC-3 ' (SEQ ID NO:341) 

reverse PCR primer S'-CCAACTfTOAfinAfiAQT A Affmnr-v <SEQ ID NO:342) 
Additionally,_a synthetic oligonucleotide hybridi2ation probe was construct 

DNA45232 sequence which had the following nucleotide sequence: 

hybridization probe 

25 5 '-TGTATGTGC AC ACCCTCACC ATCACCTCC AAGGGC AAGG AG AAC-3 * (SEQ ID NO:343). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 

isolate clones encoding the PR01379 gene using the probe oligonucleotide and one of die PCR primers. RNA 

for construction of the cDNA libraries was isolated human fetal kidney tissue. 
30 DNA sequencing of die clones isolated as described above gave the full-length DNA sequence for 

PR01379 which is designated herein as DNA59828-1608 and shown in Figure 237 (SEQ ID NO:339); and the 

derived protein sequence for PR01379 (SEQ ID NO:340). 

The entire coding sequence of PR01379 is shown in Figure 237 (SEQ ID NO:339). Clone DNA59828- 

1608 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 
35 10-12 and an apparent stop codon at nucleotide positions 1732-1734. The predicted polypeptide precursor is 574 

amino acids long. The full-length PRO 1379 protein shown in Figure 238 has an estimated molecular weight of 

about 65,355 daltons and a pi of about 8.73. Additional features include a signal peptide at about amino acids 
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1.17 and potential N-glycosylatkm sites at about amino acids 160-163, 287-290, and 323-326. 

An analysis of the Dayboff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 238 (SEQ ID NO:340). revealed some homology 
between the PR01379 amino acid sequence and the followingDayhoff sequences: YHY8YEAST, AF040625_1, 
HP714394_1, andHTV18U45630_l. 
5 Clone DNA59828-1608 has been deposited with ATCC and is assigned ATCC deposit no. 203158. 

TOf AMPLE 105: 1H*i™. nf cT)NA Cl<™* Encoding Human PRQ844. 

An expressed sequence tag (EST) DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) 
was searched and an EST was identified which showed sequence identity with aLP. Based on the information 
10 and discoveries provided herein, the clone for this EST. Incyte clone no. 2657496 from a cancerous lung library 
was further examined. 

DNA sequencing of the insert for this clone gave a sequence (herein designated as DNA59838-1462; 
SEQ ID NO-.344) which includes the full-length DNA sequence for PR0844 and the derived protein sequence 
for PR0844. 

15 The entire nucleotide sequence of DNA59838-1462 is shown in Figure 239 (SEQ ID NO:344). Clone 

DNA59838-1462 contains a single open reading frame with an apparent translatkmal initiation site at nucleotide 
positions 5-7 and ending at the stop codon at nucleotide positions 338-340 of SEQ ID NO:344 (Figure 239). 
The predicted polypeptide precursor is 111 amino acids long (Figure 240). The full-length PR0844 protein 
shown in Figure 240 has an estimated molecular weight of about 12.050 daltons and a pi of about 5.45. Clone 

20 UNQ544 DNA59838- 1462 has been deposited with ATCC on June 16, 1998. It is understood that the deposited 
clone has the actual nucleic acid sequence and that the sequences provided herein are based on known 
sequencing techniques. 

Analysis of the amino acid sequence of the full-length PR0844 polypeptide suggests that it possesses 
significant sequence similarity to serine protease inhibitors, thereby indicating that PR0844 may be a novel 
25 proteinase inhibitor. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) 
evidenced significant homology between the PR0844 amino acid sequence and at least the following Dayhoff 
sequences, ALK1HUMAN, P_P82403. P_P82402, ELAFHUMAN and P_P60950. 

EXAMPLE 106: illation of cD NA Clones Enrortinp Human PRQ848 

30 Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ*, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in , 

35 Etymology. 266:46(M80 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
me program "phrap" (Phil Green. University of Washington, Seattle. Washington). The consensus sequence 
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obtained therefrom is herein designated DNA55999. 

In light of an observed sequence homology between the DNA55999 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no . 276857 1 , the Incyte EST clone 276857 1 was purchased 
and the cDNA insert was obtained and sequenced. It was found mat this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 241 and is herein designated as DNA59839-1461. 
5 The entire nucleotide sequence of DNA59839-1461 is shown in Figure 241 (SEQ ID NO:346). Clone 

DNA59839- 1461 contains a single open reading frame with an apparent translation^ initiation she at nucleotide 
positions 146-148 and ending at the stop codon at nucleotide positions 1946-1948 of SEQ ID NO:346 (Figure 
241). The predicted polypeptide precursor is 600 amino acids long (Figure 242). The full-length PR0848 
protein shown in Figure 242 has an estimated molecular weight of about 68,536 daltons. Clone DNA59839-1461 

10 has been deposited with ATCC on June 16, 1998. It is understood that the deposited clone has the actual nucleic 
acid sequence and that the sequences provided herein are based on known sequencing techniques. 

Analysis of the amino acid sequence of the full-length PR0848 polypeptide suggests mat it may be a 
novel sialyltransferase. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) 
evidenced sequence identity between the PR0848 amino acid sequence and at least the following Dayhoff 

15 sequences, P_R78619 (GalNAc-alpha-2, 6-sialyltransferase), CAAG5_CHICK (alphani-acetylgalactosamide 
alpha-2,6-sialytransfe^ 3-sialyltransferase). 

EXAMPLE 107: Isolation of cDNA Clones Encoding Hum an PRO1097 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
20 EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 

expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA rdatabase (UFESEQ*rIncyte Pha jgc - 

homology search was performed using the computer program BLAST or BLAST2 (Ahshul et al M Methods in 

Enzvmolofiv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
25 or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 

the program "phrap* (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 

obtained therefrom is herein designated DNA56006. 

In light of an observed sequence homology between the DNA56006 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 2408105, the Incyte EST clone 2408 105 was purchased 
30 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

The sequence of this cDNA insert is shown in Figure 243 and is herein designated as DNA5984 1-1460. 

The entire nucleotide sequence of DNA5984 1-1460 is shown in Figure 243 (SEQ ID NO:348). Clone 

DNA5984 1-1460 contains a single open reading frame with an apparent translational initiation site at nucleotide 

positions 3-5 and ending at the stop codon at nucleotide positions 276-278 of SEQ ID NO:348 (Figure 243). 
35 The predicted polypeptide precursor is 91 amino acids long (Figure 244). The full-length PRO1097 protein 

shown in Figure 244 has an estimated molecular weight of about 10,542 daltons and a pi of about 10.04. Clone 

DNA5984 1-1460 has been deposited with ATCC on July 1, 1998. It is understood that the deposited clone has 
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the actual nucleic acid sequence and that the sequences provided herein are based on known sequencing 
techniques. 

Analyzing Figure 244, the signal peptide is at about amino acids 1-20 of SEQ ID NO:349. The 
glycoprotease family protein domain starts at about amino acid 56, and the acyltransfcrase ChoActase/COT/CPT 
family peptide starts at about amino acid 49 of SEQ ID NO:349. 

5 

EXAMPLE I08: f«"iatinn of cDN A clones Encoding Human PROllg 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
10 EST DNA database (LIFESEQ*. Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al. , Method? in 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
15 obtained therefrom is herein designated DNA56008. 

In light of an observed sequence homology between the DNA56008 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2472409, the Incyte EST clone 2472409 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 245 and is herein designated as DNA59842-1502. 
20 The full length clone shown in Figure 245 contained a single open reading frame with an apparent 

translational initiation site at nucleotide positions 92-94 and ending at the stop codon found at nucleotide positions 
683-685 (Figure 245; SEQ ID NO:350). The predicted polypeptide precursor (Figure 246, SEQ ID NO:35l) 
is 197 amino acids long. PR01153 has a calculated molecular weight of approximately 21,540 daltons and an 
estimated pi of approximately 8.31. Clone DNA59842-1502 has been deposited with ATCC and is assigned 
25 ATCC deposit no. 209982. It is understood that the correct and actual sequence is in the deposited clone while 
herein are present representations based on current sequencing techniques which may have minor errors. 

Based on a WU-BLAST2 sequence alignment analysis (using the ALIGN computer program) of the full- 
length sequence, PR01153 shows some amino acid sequence identity to the following Dayhoff designations: 
S57447; SOYHRGPCJ; S46965; PJ>82971; VCPHEROPHJ; EXTNTOBAC; MLCB2548_9; 
30 ANXAJRABIT; JC5437 and SSGP_VOLCA. 

EXAMPLE 109: 1«n1atinn of cDNA clones Encoding Human PROU54 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
35 expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (UFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Mtthofrft 
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Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap* (Phil Green, University of Washington* Seattle, Washington). The consensus sequence 
• obtained therefrom is herein designated DNA56025. 

In light of an observed sequence homology between the DNA56025 consensus sequence and an EST 
5 sequence encompassed within the Incyte EST clone no. 2169375, the lncyte EST clone 2169375 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 247 and is herein designated as DNA59846-1503. 

The full length clone shown in Figure 247 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 86-88 and ending at the stop codon found at nucleotide positions 
10 2909-291 1 (Figure 247; SEQ ID NO:352). The predicted polypeptide precursor (Figure 248, SEQ ID NO:353) 
is 941 amino acids long. PROl 154 has a calculated molecular weight of approximately 107, 144 daltons and an 
estimated pi of approximately 6.26. Clone DNA59846-1503 has been deposited with ATCC and is assigned 
ATCC deposit no. 209978. 

Based on a WU-BLAST2 sequence alignment analysis (using the ALIGN computer program) of the full- 
15 length sequence, PROl 154 shows sequence identity to at least the following Dayhoff designations: AB01 1097_1 , 
AMPN_HUMAN, RNU76997J, 159331, GEN 14047, HSU62768J,PJW1281,CET07F10J, SSU66371J, 
and AMPRE_HUMAN. 

EXAMPLE 110: Isolation of cDNA clones Encoding Hi.m an PRO! m 

20 Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

EST cluster sequence from the Incyte database, designated herein as 82468. This EST cluster sequence was then 

— compared to a variety of expressed sequence tag (EST)~databases which inclw 

GenBank) and a proprietary EST DNA database (LIFESEQ*. Incyte Pharmaceuticals, Palo Alto, CA) to identify 
existing homologies. The homology search was performed using the computer program BLAST or BLAST2 

25 (Altshul et al.. Methods in F-nzymnlftpy !66:46(KdR0 nOQ/ftv Those comparisons resulting in a BLAST score 
of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a 
consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, 
Washington). The consensus sequence obtained therefrom is herein designated DNA56029. 

In light of an observed sequence homology between the DNA56029 consensus sequence and an EST 

30 sequence encompassed within the Incyte EST clone no. 2186536, the Incyte EST clone 2186536 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 249 and is herein designated as DNA59847-1511. 

Clone DNA59847-1511 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 17-19 and ending at the stop codon at nucleotide positions 1328-1330 (Figure 249). 

35 The predicted polypeptide precursor is 437 amino acids long (Figure 250). The full-length PROl 181 protein 
shown in Figure 250 has an estimated molecular weight of about 46,363 daltons and a pi of about 6.22. Analysis 
of the full-length PRO! 181 sequence shown in Figure 250 (SEQ ID NO:355) evidences the presence of the 
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following: a signal peptide from about amino acid 1 to about amino acid 15. potential N-gly cosy lation sites from 
about amino acid 46 to about amino acid 49, from about amino acid 189 to about ainino acid 192 and from about 
amino acid 382 to about amino acid 385 and amino acid sequence blocks having homology to Ly-6/u-PAR 
domain proteins from about amino acid 287 to about amino acid 300 and from about amino acid 98 to about 
amino acid 111. Clone DNA59847-1511 has been deposited with ATCC on August 4, 1998 and is assigned 

5 ATCC deposit no. 203098. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35). using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 250 (SEQ ID NO:355), evidenced homology 
between the PROl 181 amino acid sequence and the following Dayhoff sequences: AP041083J, P_W26579, 
RNMAGPIAN_1, CELT13C2_2, LMSAP2GNJ, S61882. CEF35C5J2, DP87DICDI. G1U47631.1 and 

10 P_R07092. 

FX AMPLE 111: lotion of cDNA clones Encoding Human PRO 1182 

Use of the signal sequence algorithm described in Example 3 above allowed identificaticm of a single 
EST cluster sequence from the Incyte database, designated herein as 146647. This EST cluster sequence was 

15 then compared to * variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g.. GenBank) and a proprietary EST DNA database (LIFESEQ*. Incyte Pharmaceuticals. Palo Alto, CA) to 
identify existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (Altshul et al., Mr"™* " F.nzvmologv 266:4«M80 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 

20 assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56033. 

' In light of an observed sequence homology between the DNA56033 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2595195, the Incyte EST clone 2595195 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

25 The sequence of this cDNA insert is shown in Figure 251 and is herein designated as DNA59848-1512. 

Clone DNA59848-1512 contains a single open reading frame with an apparent translation^ initiation 
site at nucleotide positions 67-69 and ending at the stop codon at nucleotide positions 880-882 (Figure 251). The 
predicted poryr*ptider«ecursor is 271amu^ 

in Figure 252 has an estimated molecular weight of about 28.665 daltons and a pi of about 5.33. Analysis of 
30 the full-length PROH82 sequence shown in Figure 252 (SEQ ID NO:357) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 25. an amino acid block having 
homology to C-type lectin domain proteins from about amino acid 247 to about amino add 256 and an amino 
acid sequence block having homology to Clq domain proteins from about amino acid 44 to about amino acid 
77. Clone DNA59848-1512 has been deposited with ATCC on August 4, 1998 and is assigned ATCC deposit 
35 no. 203088. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35). using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 252 (SEQ ID NO:357). evidenced significant 
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homology between the PROl 182 ammo acid sequence and the following Dayhoff sequences: PSPD BOVIN, 
CL43_BOVIN f CONGJBOVIN, P_W 18780, P_R450Q5, P_R53257 and CELEGAP7J. 

EXAMPLE 112: Isolation of cDNA clones Encoding Human PROl 155 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
5 EST cluster sequence from the lncyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (UFESEQ* lncyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al. , Methods in 
Enzymology 266:46XM80 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
10 or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56102. 

In light of an observed sequence homology between the DNA56102 consensus sequence and an EST 
sequence encompassed within the lncyte EST clone no. 2858870, the lncyte EST clone 2858870 was purchased 
IS and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 253 and is herein designated as DNA59849-1504. 

The full length clone shown in Figure 253 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 158-160 and ending at the stop codon found at nucleotide 
positions 563-565 (Figure 253; SEQ ID NO:358). The predicted polypeptide precursor (Figure 254, SEQ ID 
20 NO:359)is 135 amino acids long. PROl 155 has a calculated molecular weight of approximately 14,833 daltons 
and an estimated pi of approximately 9.78. Clone DNA59849-1504 has been deposited with ATCC and is 

assigned ATCC deposit no.-2099^.- It U understock — 

herein are only representations which are prone to minor sequencing errors. 

Based on a WU-BLAST2 sequence alignment analysis (using the ALIGN computer program) of the full- 
25 length sequence, PROl 155 shows some amino acid sequence identity with the following Dayhoff designations: 
TKNK_BOVIN; PVB19X587J; AF019049J; P_W00948; S72864; P_W00949; 162742; AF038501J; 
TKNGJttJMAN; and YAT1RHOBL. Based on the information provided herein, PROl 155 may play a role 
in providing neuroprotection and cognitive enhancement. 

30 EXAMPLE 113: Isolation of cDNA clones Encoding Human PROl 156 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the lncyte database, designated herein as 138851. This EST cluster sequence was 
then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g., GenBank) and a proprietary EST DNA database (LIFESEQ* lncyte Pharmaceuticals, Palo Alto, CA) to 

35 identify existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (Altshul et al., Methods in Enzvmologv 2fifr4*fi4«n (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
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assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington. 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56261. 

In light of an observed sequence homology between the DNA56261 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 3675191, the Incyte EST clone 3675191 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
5 The sequence of this cDNA insert is shown in Figure 255 and is herein designated as DNA59853-1505. 

The full length clone shown in Figure 255 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 212-214 and ending at the stop codon found at nucleotide 
positions 689-691 (Figure 255; SEQ ID NO-.360). The predicted polypeptide precursor (Figure 256. SEQ ID 
NO:361)is 159 amino acids long. PRO 1 156 has a calculated molecular weight of approximately 17,476 daltons. 
L0 an estimated pi of approximately 9.15. a signal peptide sequence at about ammo acids 1 to about 22. and 
potential N-glycosylation sites at about amino acids 27-30 and 41-44. 

Clone DNA59853-1505 was deposited with the ATCC on June 16. 1998 and is assigned ATCC deposit 

no. 209985. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35). using a WU-BLAST2 sequence 
15 alignment analysis (using the ALIGN computer program) of the full-length sequence shown in Figure 256 (SEQ 
ID NO:361), revealed some homology between the PROH56 amino acid sequence and the following Dayhoff 
sequences: D45Q27J, P_R79914. JC5309. KBF2HUMAN, AF010144J. GEN14351. S68681. P.R79915. 
ZMTAC_3, and HUMCPGOJ. 

20 EXAMPLE 114: Isolation o f r.HNA Clones Encoding Human PRQ1Q98 .. 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g.. GenBank) and a proprietary 
EST DNA database (LIFESEQ*. Incyte Phamaceuticals. Palo Alto. CA) to identify existing homologies. The 
25 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods iq 
Enzvmology 266:46(MgO (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap- (Phil Green. University of Washington, Seattle. Washington). The consensus sequence 
obtained therefrom is herein designated DNA56377. 
30 In light of an observed sequence homology between the DNA56377 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 3050917, the Incyte EST clone 3050917 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 257 and is herein designated as DNA59854-1459. 

The entire nucleotide sequence of DNA59854-1459 is shown in Figure 257 (SEQ ID NO:362). Clone 
35 DNA59854-1459 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 58-60 and ending at the stop codon at nucleotide positions 292-294 of SEQ ID NO:362 (Figure 257). 
The predicted polypeptide precursor is 78 amino acids long (Figure 258). The full-length PRO1098 protein 
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shown in Figure 258 has an estimated molecular weight of about 8,396 daltons and a pi of about 7.66. Clone 
DNA59854-1459 has been deposited with ATCC on June 16, 1998. It is understood that the deposited done 
has the actual nucleic acid sequence and that the sequences provided herein are based on known sequencing 
techniques. 

Analyzing Figure 2S8 t a signal peptide appears to be at about amino acids 1-19 of SEQ ID NO:363, 
5 an N-grycosyladon site appears to be at about amino acids 37-40 of SEQ ID NO:363, and N-myristoylation sites 
appear to be at about 15-20, 19-24 and 60-65 of SEQ ID NO:363. 

EXAMPLE 115: Isolation of cDNA clones Encoding Human PROl 127 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

10 EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases <e .g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al. t Methods in 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 

15 or greater mat did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program *phrap" (Phil Green, University of Washington. Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA57959. 

In ligfrt of an observed sequence homology between the DNA57959 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. 685126, the Merck EST clone 685126 was purchased 

20 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 259 and is herein designated as DNA60283-1484. 

The full length clone shown in Figure 259 contained a single open reading frame with an apparent - 

translational initiation site at nucleotide positions 126-128 and ending at the stop codon found at nucleotide 
positions 327-329 (Figure 259; SEQ ID NO:364). The predicted polypeptide precursor (Figure 260, SEQ ID 

25 NO:365) is 67 amino acids long including a signal peptide at about 1-29 of SEQ ID NO:365. PROl 127 has a 
calculated molecular weight of approximately 7,528 daltons and an estimated pi of approximately 4.95. Clone 
DNA60283-1484 was deposited with the ATCC on July 1, 1998 and is assigned ATCC deposit no. 203043. 
It is understood that the deposited clone has the actual sequence, whereas representations which may have minor 
sequencing errors are presented herein. 

30 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

alignment analysis of the full-length sequence shown in Figure 260 (SEQ ID NO:365), revealed some homology 
between the PROl 127 amino acid sequence and the following Dayhoff sequences: AF037218_48, P_W09638, 
HBA_HETPO, S39821, KR2 EBV, CET20D3J, HCU37630J, HS193B12J0, S40012 and TRITUBCJ. 

35 
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EY AMPLE 116: rfePWA clones Encoding Hfflnari FRQUft 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the bcyte database . This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g.. GenBank) and a propnetary 
EST DNA database (LIFESEQ*. tacyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
5 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al.. M**^ 
Ei3VJ noto266:460^80(1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
orgreaterthatdidnoten** 

the program -phrap" (Phil Green, University of Washington, Seattle. Washington). The consensus sequence 
obtained therefrom is herein designated DNA56250. 
10 In light of an observed sequence homology between the DNA56250 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 1437250, the Incyte EST clone 1437250 was purchased 
and the cDNA insert was obtained and sequenced. It was found chat this insert encoded a full-length pr«em. 
The sequence of this cDNA insert is shown in Figure 261 and is herein designated as DNA60615-1483. 

Clone DNA60615-1483 contains a single open reading frame with an apparent translation* initiation 

15 steatrmcleotideposiu^ 

The predicted polypeptide precursor is 402 amino acids long (Figure 262). The full-length PROH26 protein 
shown in Figure 262 has an estimated molecular weight of about 45.92 1 claUons and a pi of about 8.60. Analyse 
of the full-length PRO 1126 sequence shown in Figure 262 (SEQ ID NO:367) evidences the presence of the 
following- a signal peptide from about amino acid 1 to about amino acid 25 and potential N-glycosylatkm sues 
20 from about amino acid 66 to about amino acid 69, from about amino acid 138 to about amino acid ,41 and from 
about amino acid 183 to about amino acid 186. Clone DNA60615-1483 has been deposited with ATCC on June 
16 1998 and is assigned ATCC deposit no. 209980. 

An analysis of the Dayhoff database (version 35.45 SwissPro, 35), using a WU-BLAST2 sequence 
alignment analysis of the fuU-lengm sequence shown in Figure 262 (SEQ ID NO:367), evidenced significant 
25 homology between the PRO 11 26 amino acid sequence and the following Dayhoff sequences: 173636, 
NOM* HUMAN. MMUSMYOC3.1. HS454G6_1. P_R98225. RNU78105J, RNU72487_1. AF035301J. 
CEELC48E7_4 and CEFl 1C3_3. 



30 
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Use of me signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included pubtic EST databases (e.g. , GenBanx) and a proprietary 
EST DNA database (LIFESEQ*, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., M^sjn 
35 Enzy JE o^266:46(M80(1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did k« encode known protein were cluster^ 

the program "phrap- (Phil Green. University of Washington. Seattle, Washington). The consensus sequence 
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obtained therefrom is herein designated DNA56S40. 

In light of an observed sequence homology between the DNA56540 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 14861 14, the Incyte EST clone 14861 14 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 263 and is herein designated as DNA60615-1483. 
5 The full length clone shown in Figure 263 contained a single open reading frame with an apparent 

translational initiation site at nucleotide positions 47-49 and ending at the stop codon found at nucleotide positions 
1388-1390 (Figure 263; SEQ ID NO:368>. The predicted polypeptide precursor (Figure 264, SEQ ID NO:369) 
is 447 amino acids long. PROl 125 has a calculated molecular weight of approximately 49,798 daltons and an 
estimated pi of approximately 9.78. Clone DNA60619-1482 has been deposited with ATCC and is assigned 

10 ATCC deposit no. 209993. It is understood that the clone has the actual sequence and that the sequences herein 
are representations based on current techniques which may be prone to minor errors . 

Based on a WU-BLAST2 sequence alignment analysis (using the ALIGN computer program) of the full- 
length sequence, PROl 125 shows some sequence identity with the following Dayhoff designations: 
RC01_NEUCR; S58306; PKWA_THECU; S76086; P.R85881; HETl_PODAN; SPU92792J; 

15 APAF_HUMAN;S76414andS59317. 



EXAMPLE 118: Isolation of cDNA clones Encoding Human P^pi 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 

20 expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto. CA) to identify existing homologies. The 
homology search was performed using the com puter program BLAST o r_BLAST2_(Altshul et al.,_ Methods in 
pnzvmoloKY 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 

25 the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56748. 

In light of an observed sequence homology between the DNA56748 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 3476792, the Incyte EST clone 3476792 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 

30 The sequence of this cDNA insert is shown in Figure 265 and is herein designated as DNA60621-1516. 

The full length clone shown in Figure 265 contained a single open reading frame with an apparent 
translaUonal initiation site at nucleotide positions 9 1 -93 and ending at the stop codon found at nucleotide positions 
406408 (Figure 265; SEQ ID NO:370). The predicted polypeptide precursor (Figure 266, SEQ ID NO:371) 
is 105 amino acids long. The signal peptide is at amino acids 1-19 of SEQ ID NO:371. PR01186 has a 

35 calculated molecular weight of approximately 1 1,715 daltons and an estimated pi of approximately 9.05. Clone 
DNA6062M516 was deposited with the ATCC on August 4, 1998 and is assigned ATCC deposit no. 203091. 
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An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 266 (SEQ ID NO:371), revealed some sequence 
identity between the PRO 1186 amino acid sequence and the following Dayhoff sequences: VPRADENPO, 
LFE4CHICK, AF034208_1. AF030433J, A55035, COL_RABTT, CELB0507_9, S67826J, S34665 and 
CRU73817J. 

5 

FY AMPLE 119: illation of c HNA clones Encoding Human PRO 1198 

An initial DNA sequence referred to herein as DNA52083 was identified using a yeast screen in a 
human umbilical vein endothelial cell cDNA library that preferentially represents the 5' ends of the primary 
cDNA clones. DNA52083 was compared to ESTs from public databases (e.g., GenBank), and a proprietary 
10 EST database (LEFESEQ®, Incyte Pharmaceuticals, Palo Alto. CA), using the computer program BLAST or 
BLAST2 [Altschul et al., MT*"* " 266:4«M80 (1996)). The ESTs were clustered and 

assembled into a consensus DNA sequence using the computer program "phrap" (Phil Green. University of 
Washington, Seattle. Washington). One or more of the ESTs was obtained from human breast skin tissue 
biopsy. This consensus sequence is designated herein as DNA52780. 
15 In light of an observed sequence homology between the DNA52780 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 3852910, the Incyte EST clone 3852910 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 267 and is herein designated as DNA60622-1525. 

The full length DNA60622-1525 clone shown in Figure 267 (SEQ ID NO:372) contained a single open 
20 reading frame with an apparent translation^ initiation site at nucleotide positions 54 to 56 and ending at the stop 
codon found at nucleotide positions 74 1 to 743. The predicted polypeptide precursor, which is shown in Figure 
268 (SEQ ID NO:373), is 229 amino acids long. PROH98 has a calculated molecular weight of approximately 
25,764 daltons and an estimated pi of approximately 9.17. There is a signal peptide sequence at about amino 
acids 1 through 34. There is sequence identity with glycosyl hydrolases family 31 protein at about amino acids 
25 142 to about 175. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 268 (SEQ ID NO:373). revealed some homology 
between the PROH98 amino acid sequence and the following Dayhoff sequences: ATF6H11_6, UCRI_RAT, 
TOBSUP2NTJ, RCUERF3_1. AMU88186J, P_W22485, S56579. AF040711_1, DPP4_PIG. 
30 Clone DNA60622-1525 was been deposited with the ATCC on August 4. 1998, and is assigned ATCC 

deposit no. 203090. 

px AMPLE 120: illation of cDN A clones Encoding Human PRQMS8 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
35 EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e. g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
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homology search was performed using the computer program BLAST or BLAST2 (Altshul ct al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DN A sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA57248. 
5 In light of an observed sequence homology between the DNA57248 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 2640776, the Incyte EST clone 2640776 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 269 and is herein designated as DNA60625-1507. 

Hie full length clone shown in Figure 269 contained a single open reading frame with an apparent 

10 translational initiation site at nucleotide positions 163 to 165 and ending at the stop codon found at nucleotide 
positions 532 to 534 (Figure 269; SEQ ID NO:374). The predicted polypeptide precursor (Figure 270, SEQ 
ID NO: 375) is 123 amino acids long. PRO 1158 has a calculated molecular weight of approximately 13,113 
daltons and an estimated pi of approximately 8 .53 . Additional features include a signal peptide sequence at about 
amino acids 1-19, a transmembrane domain at about amino acids 56-80, and a potential N-glycosylation site at 

15 about amino acids 36-39. Clone DNA60625-1507 was deposited with the ATCC on June 16, 1998 and is 
assigned ATCC deposit no. 209975. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 270 (SEQ ID NO:375), revealed some homology 
between the PROl 158 amino acid sequence and the following Dayhoff sequences: ATACOO31O51OF18A8.10, 

20 P_R85151, PHS2_SOLTU, RNMHCIBACJ, RNA1FMHC 1, 168771, RNRT1A10GJ, PTPA_HUMAN, 
HUMGACAJ, and CHKPTPAJ. 



EXAMPLE 121: Isolation of cDNA clones Encoding Human PRQU?9 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

25 EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (UFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al. , Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 

30 or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA57221 . 

In light of an observed sequence homology between the DNA57221 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 376776, the Incyte EST clone 376776 was purchased 

35 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 271 and is herein designated as DNA60627-1508. 
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Clone DNA60627-1508 contains a single open reading frame with an apparent translation^ initiation 
site at nucleotide positions 92-94 and ending at the stop codon at nucleotide positions 362-364 (Figure 271). The 
predicted polypeptide precursor is 90 amino acids long (Figure 272). The full-length PRO 1159 protein shown 
in figure 272 has an estimated molecular weight of about 9,840 daltons and a pi of about 10. 13. Analysis of 
the full-length PRO 1159 sequence shown in Figure 272 (SEQ ID NO:377) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 15 and a potential N-glycosylation site 
from about amino acid 38 to about amino acid 41 . Clone DNA60627-1508 has been deposited with ATCC on 
August 4, 1998 and is assigned ATCC deposit no. 203092. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35). using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 272 (SEQ ID NO:377), evidenced significant 
homology between the PRO 1159 amino acid sequence and the following Dayhoff sequences: AH>16494_6, 
AF036708_20. DSSCUTEJ, D89100J, S28060, MEFAXENLA, AF020798_12,G70065.E64423.JO2005. 

EXAMPLE 122: ]^Mf« nf cD^A clones EncoriinP Human PRO 1124 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was men compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al./ Methods in 
F.nTvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington. Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56035. 

In light of an observed sequence homology between the DNA56035 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2767646, the Incyte EST clone 2767646 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 273 and is herein designated as DNA60629-1481. 

The full length clone shown in Figure 273 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 25-27 and ending at the stop codon found at nucleotide positions 
2782-2784 (Figure 273; SEQ ID NO:378). The predicted polypeptide precursor (Figure 274. SEQ ID NO:379) 
is 919 amino acids long. PROl 124 has a calculated molecular weight of approximately 101,282 daltons and an 
estimated pi of approximately 5.37. Clone DNA60629-1481 has been deposited with the ATCC and is assigned 
ATCC deposit no. 209979. It is understood that the deposited done has the actual sequence, whereas only 
representations based on current sequencing techniques which may include normal and minor errors, are 
provided herein. 

Based on a WU-BLAST2 sequence alignment analysis of the full-length sequence, PR01124 shows 
significant amino acid sequence identity to a chloride channel protein and to ECAM-1. Specifically, the 
following Dayhoff designations were identified as having sequence identity with PROl 124: ECLCBOVIN, 
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AF001261 J, PJW06548, SSC6A10J, AF004355J, S76691, AFO 17642, BYU068662, CSA_DICDI and 
SAU47139 2. 

EXAMPLE 123: Isolation of cDNA clones Encoding Human PRQ1287 

An expressed sequence tag (EST) DNA database (UFESEQ® Incyte Pharmaceuticals, Palo Alto, C A) 

5 was searched and an EST was identified which showed homology to the fringe protein. This EST sequence was 
then compared to various EST databases including public EST databases (e.g., GenBank), and a proprietary EST 
database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify homologous EST sequences. The 
comparison was performed using the computer program BLAST or BLAST2 [Altschul et al M Methods in 
Fpr vmnlopv 266:460-480 (1996)]. Those comparisons resulting in a BLAST score of 70 (or in some cases, 

10 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence 
with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). This consensus 
sequence obtained is herein designated DNA4Q568. 

Based on the DNA40568 consensus sequence, oligonucleotides were synthesized: 1) to identify by PGR 
a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of die full- 

15 length coding sequence for PR01287. Forward and reverse PCR primers generally range from 20 to 30 
nucleotides and are often designed to give a PCR product of about 100- 1000 bp in length. The probe sequences 
are typically 40-55 bp in length. In some cases, additional oligonucleotides are synthesized when the consensus 
sequence is greater than about 1-1 .5kbp. In order to screen several libraries for a full-length clone, DNA from 
the libraries was screened by PCR amplification, as per Ausubel et al. , Current Proto cols in Molecular Biology. 

20 supra . with the PCR primer pair. A positive library was then used to isolate clones encoding the gene of interest 
using die probe oligonucleotide and one of the primer pairs. 

rcR priors (forwanfand reverse) were synthesized: " " ~ 

forward PCR primer 5*-CTCGGGGAAAGGGACTTGATGTTGG-3 ' (SEQ ID NO:382) 
reverse PCR primer 1 5'-GCGAAGGTGAGCCTCTATCTCGTGCC-3* (SEQ ID NO:383) 

25 reverse PCR primer 2 5-CAGCCTACACGTATTGAGG-3' (SEQ ID NO:384) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA40568 
sequence which had the following nucleotide sequence 
frybrjdizanpq probe 

5'-CAGTCAGTACAATCCTGGCATAATATACGGCCACCATGATGCAGTCCC-3' (SEQ ID NO:385). 

30 In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pairs identified above. A positive library was then used 
to isolate clones encoding die PR01287 gene using the probe oligonucleotide and one of the PCR primers. 

RN A for construction of the cDNA libraries was isolated from human bone marrow tissue. The cDN A 
libraries used to isolated the cDN A clones were constructed by standard methods using commercially available 

35 reagents such as those from Invitrogen, San Diego, C A. The cDN A was primed with oligo dT containing a NotI 
site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized appropriately by gel 
electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or pRKD; 



475 



WO 99/63088 



PCT/US99/12252 



pRK5B is a precursor of pRKSD that does not contain the Sfil site; see. Holmes et al. , Science, 252=1278-1280 
(1991)) in the unique Xhol and NotI sites. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR01287 (designated herein as DNA61755-1S54 [Figure 275, SEQ ID NO:380]) and the derived protein 
sequence for PR01287. 

The entire nucleotide sequence of DNA61755-1554 is shown in Figure 275 (SEQ ID NO:380). The 
full length clone contained a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 655-657 and a stop signal at nucleotide positions 2251-2253 (Figure 275, SEQ ID NO:380). The 
predicted polypeptide precursor is 532 amino acids long, has a calculated molecular weight of approximately 
61,351 daltons and an estimated pi of approximately 8.77. Analysis of the full-length PR01287 sequence shown 
in Figure 276 (SEQ ID NO:381) evidences the presence of the following: a signal peptide from about amino acid 
1 to about amino acid 27 and potential N-glycosylation sites from about amino acid 315 to about amino acid 318 
and from about amino acid 324 to about amino acid 327. Clone DNA61755-1554 has been deposited with 
ATCC on August 1 1 , 1998 and b assigned ATCC deposit no. 2031 12. 

An analysis of the Dayboff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 276 (SEQ ID NO:381), evidenced significant 
homology between the PR01287 amino acid sequence and the following Dayhoff sequences: CET24D1_1, 
EZRI_BOVIN, GGU19889J, CC3_YEAST, S74244. NALS.MOUSE, MOES_PIG. S28660, S44860 and 
YNA4CAEEL. 

EXAMPLE 124: Ration nf cDNA clones Encoding Human PRQ1312 

DNA55773 was identified in a human fetal kidney cDN A library using a yeast screen mat preferentially 
represents the 5' ends of the primary cDNA clones. Based on the DNA55773 sequence, oligonucleotides were 
synthesized for use as probes to isolate a clone of the full-length coding sequence for PR01312. 

The full length DNA61873-1574 clone shown in Figure 277 (SEQ ID NO:386) contained a single open 
reading frame with an apparent translation^ initiation site at nucleotide positions 7-9 and ending at the stop 
codon found at nucleotide positions 643-645. The predicted polypeptide precursor is 212 amino acids long 
(Figure 278, SEQ ID NO:387). PR01312 has a calculated molecular weight of approximately 24,024 daltons 
and an estimated pi of approximately 6.26. Other features include a signal peptide at about amino acids 1-14; 
a transmembrane domain at about amino acids 141-160, and potential N-glycosylation sites at about amino acids 
76-79 and 93-96. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 278 (SEQ ID NO:387), revealed some homology 
between the PRO 13 12 amino acid sequence and the following Dayhoff sequences: GCINTALPH1, 
GIBMUC1A_1, P_R96298. AF001406J, PVU88874J, P_R85151. AF041409J. CELC50F2_7. C45875, 
and AB009510_21. 

Clone DNA61873-1574 has been deposited with ATCC and is assigned ATCC deposit no. 203132. 
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FXAMP1LE }2S; Isolation of cPNA clones Encoding Human, PRQUflj 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is designated herein DNA35924. Based on the DNA35924 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
5 PROl 192. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer: S'-CCGAGGCCATCTAGAGGCCAGAGW (SEQ ID NO:390) 
reverse PCR primer: 5 * - AC AGGC AG AGCC AATGGCC AG AGC-3 * (SEQ ID NO:391). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from die consensus 
10 DNA35924 sequence which had the following nucleotide sequence: 
hy^d}za^oqpyot>e; 

5 , -GAGA(K}ACTGCGGGAGTTTGGGACC^TTGTGCAGACGTGCTCATG-3• (SEQ ID NO:392). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
IS isolate clones encoding the PROl 192 gene using the probe oligonucleotide and one of the PCR primers. RNA 

for construction of the cDNA libraries was isolated from human fetal liver and spleen tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PROl 192 designated herein as DNA62814-1521 and shown in Figure 279 (SEQ ID NO: 388); and the derived 

protein sequence for PROl 192 which is shown in Figure 280 (SEQ ID NO: 389). 
20 The entire coding sequence of PROl 192 is shown in Figure 279 (SEQ ID NO: 388). Clone DNA62814- 

1521 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 

121-123 and an apparent stop a>don at nucleotide ^ precursor is 215 

amino acids long. The predicted polypeptide precursor has the following features: a signal peptide at about 

amino acids 1-21; a transmembrane domain at about amino acids 153-176; potential N-glycosylation sites at 
25 about amino acids 39-42 and 118-121; and homology with myelin P0 proteins at about amino acids 27-68 and 

99-128 of Figure 280. The full-length PROl 192 protein shown in Figure 280 has an estimated molecular weight 

of about 24,484 daltons and a pi of about 6.98. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

alignment analysis of the full-length sequence shown in Figure 280 (SEQ ID NO: 389), revealed homology 
30 between the PROl 192 amino acid sequence and the following Dayhoff sequences: GEN12838, MYP0HUMAN, 

AF049498J, GEN14531, P_W14146, HS46KDAJ, CINBRAT. OX2G_RAT, D87018J, and D86996_2. 
Clone DNA62814-1521 was deposited with the ATCC on August 4, 1998, and is assigned ATCC 

deposit no. 203093. 

35 
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Fy AMPLE 126: Isolation of fTMA. clones Encoding Human PRQUflL 

A consensus DN A sequence was assembled relative toother EST sequences using phrap as described 
in Example 1 above This consensus sequence is herein designated DNA40650. Based on the DNA40650 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a done of the full-length coding sequence for 

5 PRO 1160. 

PCR primers (forward and reverse) were synthesized: 
f m rf PCR primer 5 ' -GCTCCCTG ATCTTC ATGTC ACCACC-3 (SEQ ID NO:395) 

PCR primer 5'-CAOOGACACACTCTACCATTCGGGAG-3' (SEQ ID NO:396) 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA40650 
10 sequence which had the following nucleotide sequence 
f^Hiratinn nrobe 

5^CATCITTCTGGTCrCTGCCCAGAATCCGACAACAGCTGCTC-3' (SEQ ID NO:397) 

in order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above . A positive library was then used to 
15 isolate clones encoding the PROl 160 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human breast tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PRO1160 (designated herein as DNA62872-1509 [Figure 281. SEQ ID NO: 393]) and the derived protein 
sequence for PROl 160. 

20 The entire nucleotide sequence of DNA62872-1509 is shown in Figure 281 (SEQ ID NO-.393). Clone 

DNA62872-1509 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 40-42 and ending at the stop codon at nucleotide positions 310-312 (Figure 281). The predicted 
polypeptide precursor is 90 amino acids long (Figure 282). The full-length PROl 160 protein shown in Figure 
282 has an estimated molecular weight of about 9.039 dahons and a pi of about 4.37. Analysis of the full-length 

25 PROl 160 sequence shown in Figure 282 (SEQ ID NO:394) evidences the presence of the following: a signal 
peptide from about amino acid 1 to about amino acid 19 and a protein kinase C phosphorylation site from about 
amino acid 68 to about amino acid 70. Clone DNA62872-1509 has been deposited with ATCC on August 4. 
1998 and is assigned ATCC deposit no. 203100. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35). using a WU-BLAST2 sequence 

30 alignment analysis of the full-length sequence shown in Figure 282 (SEQ ID NO:394). evidenced significant 
homology between the PRO 11 60 amino acid sequence and the following Dayhoff sequences: B30305, 
GEN13490. 153641. S53363, HA34_BRELC, SP96_DICDI, S36326, SSU51197_10. MUC1_XENLA. 
TCU32448J and AF000409J. 

35 EXAMPLE 127: Isolation of c D NA ^aaet Encodine Human PROU87 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
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expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (UFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 <Altshul et al. t Methods in 
Enzvmologv 266:460480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater mat did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
5 the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNAS7726. 

In light of an observed sequence homology between the DNA57726 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 358563, the Incyte EST clone 358563 was purchased 
. and the cDNA insert was obtained and sequenced. It was found mat this insert encoded a full-length protein. 
10 The sequence of this cDNA insert is shown in Figure 283 and is herein designated as DNA62876-1517. 

The full length clone shown in Figure 283 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 121-123 and ending at the stop codon found at nucleotide 
positions 481-483 (Figure 283; SEQ ID NO:398). The predicted polypeptide precursor (Figure 284, SEQ ID 
NO:399) is 120 amino acids long. The signal peptide is at about amino acids 1-17 of SEQ ID NO:399. 
15 PROH87 has a calculated molecular weight of approximately 12,925 daltons and an estimated pi of 
approximately 9.46. Clone DNA62876-1517 was deposited with the ATCC on August 4, 1998 and is assigned 
ATCC deposit no. 203095. It is understood that the deposited clone contains the actual sequence and that the 
representations herein may have minor sequencing errors. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
20 alignment analysis of the full-length sequence shown in Figure 284 (SEQ ID NO: 399), revealed some sequence 
identity (and therefore some relation) between the PRO 11 87 amino acid sequence and the following Dayhoff 

sequences:— MGNEMX)BX-lT—CEIJ41G3-9—AMro^ 

AF029958J and P_W04957. 

25 EXAMPLE 128: Isolation of cDNA clones Encoding Human PROl 185 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was men compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 

30 homology search was performed using the computer program BLAST or BLAST2 (Altshul etal., Methods in 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56426. 

35 In light of an observed sequence homology between the DNA56426 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 32844 1 1 , the Incyte EST clone 32844 1 1 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
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The sequence of this cDNA insert is shown in Figure 285 and is herein designated as DNA62881-1515. 

The full length DNA6288 1-1515 clone shown in Figure 285 contained a single open reading frame with 
an apparent translational initiation site at nucleotide positions 4-6 and ending at the stop codon found at 
nucleotide positions 598-600 (Figure 285; SEQ ID NO:400). The predicted polypeptide precursor (Figure 286, 
SEQ ID NO:401) is 198 amino acids long. The signal peptide is at about amino acids 1-21 of SEQ ID NO:401 . 
5 PR01185 has a calculated molecular weight of approximately 22,105 dahons and an estimated pi of 
approximately 7.73. Clone DNA62881-1515 has been deposited with the ATCC and is assigned ATCC deposit 
no. 203096. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 286 (SEQ ID NO:401), revealed some sequence 
10 identity between the PRO 11 85 amino acid sequence and the following Dayhoff sequences: TUP1_YEAST, 
AF041382J, MAOMSOLTU, SPPBPHU9J ,141024, EPCPLCFA1LJ, HSPLECJ. YKL4 CAEEL, 
A44643, TGU65922J. 

EXAMPLE 129: tr W ?" " f cDNA clones Encoding Human PRQ134S 
15 A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 

in Example 1 above. This consensus sequence is herein designated DNA47364. Based on the DNA47364 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PR01345. 

20 PCR primers (forward and reverse) were synthesized: 

forward PCR primer S'-CCTGGTTATCCCCAGGAACTCCGAC-S- (SEQ ID NO:404) 

rwrse PCR primer 5'-CTCTTGCTGCTGCGACAGGCCTC-3' (SEQ ID NO:405) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA47364 

sequence which had the following nucleotide sequence 

25 h T tr i<l " ati " nnrohe 

5--CGCCCTCCAAGACTATGGTAAAAGGAGCCTGCCAGGTGTCAATGAC-3' (SEQ ID NO:406) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR01345 gene using the probe oligonucleotide and one of the PCR primers. RNA 
30 for construction of the cDNA libraries was isolated from human breast carcinoma tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR01345 (designated herein as DNA64852-1589 [Figure 287, SEQ ID NO:402]) and the derived protein 
sequence for PR01345. 

The entire nucleotide sequence of DNA64852-1589 is shown in Figure 287 (SEQ ID NO:402). Clone 
35 DNA64852-1589 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 7-9 or 34-36 and ending at the stop codon at nucleotide positions 625-627 (Figure 287). The predicted 
polypeptide precursor is 206 amino acids long (Figure 288). The full-length PR01345 protein shown in Figure 
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288 has an estimated molecular weight of about 23,190 daltons and a pi of about 9.40. Analysis of the full- 
length PR01345 sequence shown in Figure 288 <SEQ ID NO:403) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about amino acid 31 or from about amino acid 10 to about amino acid 
31 and a C-typc lectin domain signature sequence from about amino acid 176 to about amino acid 190. Clone 
DNA64852-1589 has been deposited with ATCC on August 18, 1998 and is assigned ATCC deposit no. 203127. 
5 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

alignment analysis of the full-length sequence shown in Figure 288 (SEQ ID NO:403), evidenced significant 
homology between the PR01345 amino acid sequence and the following Dayhoff sequences: BTU22298_1, 
TETNCARSP, TETN_HUMAN, MABA_RAT, S34198, PJW13144, MACMBPAJ, A46274, PSPD_RAT 
AND P_R32188. 

10 

EXAMPLE 130: Isolation of cDNA clones Encoding Human PRQ1245 

Use of die signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 

IS EST DNA database (LIFESEQ*, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program *phrap w (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 

20 obtained therefrom is herein designated DNA56019. 

In light of an observed sequence homology between the DNA56019 consensus sequence and an EST 

sequence encompassed within the Incyte EST ^done no-1327836rme Incyte EST clone-1327836 was purchased— 

and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 289 and is herein designated as DNA64884-1527. 

25 The full length clone shown in Figure 289 contained a single open reading frame with an apparent 

translational initiation site at nucleotide positions 79-81 and ending at the stop codon found at nucleotide positions 
391-393 (Figure 289; SEQ ID NO:407). The predicted polypeptide precursor (Figure 290, SEQ ID NO:408) 
is 104 amino acids long, with a signal peptide sequence at about amino acid 1 to about amino acid 18. 
PRO 1245 has a calculated molecular weight of approximately 10,100 daltons and an estimated pi of 

30 approximately 8.76. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 290 (SEQ ID NO:408), revealed some homology 
between the PRO 1245 amino acid sequence and the following Dayhoff sequences: SYAJTHETH, GEN1 1 167, 
MTV044_4, AB011151 J, RLAJ2750_3, SNEUPTRAJ, S63624, C28391, A37907, and SI 4064. 

35 Clone DNA64884-1245 was deposited with the ATCC on August 25, 1998 and is assigned ATCC 

deposit no. 203155. 
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EXAMPLE 131: Isolation of cDN A clones E ncoding Hwan pyQlggg 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (UFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et ah, Methods in 
Enzvmologv 266:460480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). 

In light of an observed sequence homology between the consensus sequence and an EST sequence 
encompassed within the Incyte EST clone no. 88718, the Incyte EST clone 88718 was purchased and the cDNA 
insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The sequence 
of this cDNA insert is shown in Figure 291 and is herein designated as DNA64890-1612. 

The full length clone shown in Figure 291 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 86 through 88 and ending at the stop codon found at nucleotide 
positions 1418 through 1420 (Figure 291; SEQ ID NO:409). The predicted polypeptide precursor (Figure 292, 
SEQ ID NO:410) is 444 amino acids long. The signal peptide is at about amino acids 1-18 of SEQ ID NO:410. 
PR01358 has a calculated molecular weight of approximately 50,719 daltons and an estimated pi of 
approximately 8.82. Clone DNA64890-1612 was deposited with the ATCC on August 18, 1998 and is assigned 
ATCC deposit no. 203131. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 292 (SEQ ID NO:410), revealed sequence identity 
between the PR01358 amino acid sequence and the following Dayhoff sequences: P_W07607, AB000545_1, 
AB000546J , A1ATRAT, AB015164_1,P_P50021 , COTRCAVPO, and HAMHPP_1. The variants claimed 
in this application exclude these sequences. 

EXAMPLE 132: Isolation of cDN A clones Encoding Human PROl 195 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (UFESEQ* Incyte Pharmaceuticals, Palo Alto, C A) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et aL, Methods jp 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNAS5716. 

In light of an observed sequence homology between the DNA55716 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 3252980, the Incyte EST clone 3252980 was purchased 
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and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 293 and is herein designated as DNA65412-1523. 

The full length clone shown in Figure 293 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 58-60 and ending at the stop codon found at nucleotide positions 
51 1-513 (Figure 293; SEQ ID NO:41 1). The predicted polypeptide precursor (Figure 294, SEQ ID NO:412) 
5 is 151 amino acids long. The signal sequence is at about amino acids 1-22 of SEQ ID NO:412. PROl 195 has 
a calculated molecular weight of approximately 17,277 daltons and an estimated pi of approximately 5.33. Clone 
DNA65412-1523 was deposited with the ATCC on August 4, 1998 and is assigned ATCC deposit no. 203094. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 294 (SEQ ID NO:412), revealed some sequence 
10 identity between the PROl 195 amino acid sequence and the following Dayhoff sequences: MMU28486 1, 
AF044205J, PW31186, CELK03C7J, F69034, EF1A_METVA, AF024540J, SSU9Q353J. 
MRSPJSTAAU and PJR97680. 

EXAMPLE 133: Isolation of cDNA clones Encoding Human PRO1270 

IS Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included publ ic EST databases (e.g., GenBank) and a proprietary 
EST DNA database (UFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al. t Methods in 

20 Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap* (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA57951 . 

In light of an observed sequence homology between the DNA57951 consensus sequence and an EST 

25 sequence encompassed within the Merck EST clone no. 124878, the Merck EST clone 124878 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 295 and is herein designated as DNA66308-1537. 

Clone DNA66308-1537 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 103-105 and ending at the stop codon at nucleotide positions 1042-1044 (Figure 295). 

30 The predicted polypeptide precursor is 313 amino acids long (Figure 296). The full-length PRO1270 protein 
shown in Figure 296 has an estimated molecular weight of about 34,978 daltons and a pi of about 5.71 . Analysis 
of the full-length PRO1270 sequence shown in Figure 296 (SEQ ID NO:414) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 16, a potential N-glycosylation site from 
about amino acid 163 to about amino acid 166 and glycosaminoglycan attachment sites from about amino acid 

35 74 to about amino acid 77 and from about amino acid 289 to about amino acid 292. Clone DNA66308-1537 has 
been deposited with ATCC on August 25, 1998 and is assigned ATCC deposit no. 203159. 
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An analysis of the Dayhoff database (version 35,45 SwissProt 35). using a WU-BLAST2 sequence 
aUgnment analysis of the Ml-length sequence shown in Figure 296 (SEQ ID NO:414). evidenced significant 
homology between the PRO1270 amino acid sequence and the following Dayhoff sequences: XLU86699J, 
S49589,FIBA_PARPA,FffiB_HUMAN,P.R47189,ARX)4326J.DRTENASCN_l,ARW 

and FIBGBOVIN. 

5 

^AMPLE 134: ^solution of r HN A clones E ™*wtin f Haman PRQ1271 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from die Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e .g. , GenBank) and a proprietary 
LO EST DNA database (LIFESEQ*, Incyte Pharmaceuticals, Palo Alto. CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., MfifljaJOL. 
Emvrnologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green. University of Washington. Seattle. Washington). The consensus sequence 
15 obtained therefrom is herein designated DNA57955. 

In light of an observed sequence homology between the DNA57955 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. AA625350. the Merck EST clone AA625350 was 
purchased and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length 
protein. The sequence of this cDNA insert is shown in Figure 297 and is herein designated as DNA66309- 1538. 
20 Clone DNA66309-1538 contains a single open reading frame with an apparent translational initiation 

site at nucleotide positions 94-96 and ending at the stop codon at nucleotide positions 7 1 8-720 (Figure 297) . The 
predicted polypeptide precursor is 208 amino acids long (Figure 298). The full-length PR01271 protein shown 
fa Figure 298 has an estimated molecular weight of about 21.531 daltons and a pi of about 8.99. Analysis of 
the full-length PR01271 sequence shown in Figure 298 (SEQ ID NO:416) evidences the presence of the 
25 following: a signal peptide from about amino acid 1 to about amino acid 31 and a transmembrane domain from 
about amino acid 166 to about amino acid 187. Gone DNA66309-1538 has been deposited with ATCC on 
September 15, 1998 and is assigned ATCC deposit no. 203235. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35). using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 298 (SEQ ID NO:416), evidenced significant 
30 homology between the PR01271 amino acid sequence and the following Dayhoff sequences: S57180, S63257, 
AGA1.YEAST. BPU43599J. YS8A_CAEEL, S67570. LSU54556.2. S70305. VGLX.HSVEB. and 
D88733J. 

EXAMPLE 135: Illation of cDNA . clones Encoding Human PRQ1375 
35 A Merck/Wash. U. database was searched and a Merck EST was identified. This sequence was then 

put in a program which aligns it with other seequences from the Swiss-Prot public database, public EST 
databases (e.g., GenBank. Merck/Wash. U.), and a proprietary EST database (LIFESEQ*. Incyte 
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Pharmaceuticals, Palo Alto, CA). The search was performed using the computer program BLAST or BLAST2 
[Altschul et al., Methods in Enzvmolopy. 266:460-480 (1996)] as a comparison of the extracellular domain 
(ECD) protein sequences to a 6 frame translation of the EST sequences. Those comparisons resulting in a 
BLAST score of 70 (or in some cases, 90) or greater that did not encode known proteins were clustered and 
assembled into consensus DNA sequences with the program "phrap" (Phil Green, University of Washington, 
5 Seattle, Washington). 

A consensus DNA sequence was assembled relative to other EST sequences using phrap. This 
consensus sequence is designated herein "DNA67003". 

Based on theDNA67003 consensus sequence, the nucleic acid (SEQ ID NO:417) was identified in a 
human pancreas library. DNA sequencing of the clone gave the full-length DNA sequence for PR01375 and 
10 the derived protein sequence for PR01375. 

The entire coding sequence of PR01375 is shown in Figure 299 (SEQ ID NO:4 17). Clone DNA67004- 
1614 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 
104-106 and an apparent stop codon at nucleotide positions 698-700 of SEQ ID NO:417. The predicted 
polypeptide precursor is 198 amino acids long. The transmembrane domains are at about amino acids 1 1-28 
15 (type II) and 103-125 of SEQ ID NO:418. Clone DNA67004-1614 has been deposited with ATCC and is 
assigned ATCC deposit no. 203115. The full-length PR01375 protein shown in Figure 300 has an estimated 
molecular weight of about 22,531 daltons and a pi of about 8.47. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 300 (SEQ ID NO:4 1 8), revealed sequence identity 
20 between the PR01375 amino acid sequence and the following Dayhoff sequences: AF026198_5, CELR12C12_5, 
S73465, Y011_MYCPN, S64538J, P_P8150, MUVSHPO10J, VSHMUMPL andCVU59751_5. 



EXAMPLE 136: Isolation of cDNA cl ones Encoding Human PRQ1385 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

25 EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (UFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzymology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 

30 or greater mat did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA57952. 

In light of an observed sequence homology between the DNA57952 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 3129630, the Incyte EST clone 3129630 was purchased 

35 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 301 and is herein designated as DNA68869-1610. 
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Clone DNA68869-1610 contains a single open reading frame with an apparent translation^ initiation 
siteat .^depositions 26-28 *^«*m 9 a^«m^t*^*»* a ** n * a! >-' n * 
predicted polypeptide precursor is 128 amino acids long (Figure 302). The full-length PR01385 protein shown 
in Figure 302 has an estimated molecular weight of about 13,663 dalums andapl of about 10.97. Analyse of 
the full-length PR01385 sequence shown in Figure 302 (SEQ ID NO:420) evidences the presence of the 
5 following: a signal peptide from about amino acid 1 to about amino acid 28. and glyc^laminogrycan attachment 
sites fromabout amino acid 82 to about ammo acid 85 and from about amino acid 91 to about amino acid 94. 
Clone DNA68869-1610 has been deposited with ATCC on August 25. 1998 and is assigned ATCCdepostt no. 
203164. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35). using a WU-BLAST2 sequence 
10 alignment analysis of the full-length sequence shown in Figure 302 (SEQ ID NO:420). evidenced low homology 
between the PRD1385 amino acid sequence and the following Dayhoff sequences: CELT14A81, 
LMNACHRA1J. HXD9.HUMAN. CHKCMLF.l. HS5PP34.2. DMDRING.l. A37107.1. 
MMLUNGENE l , PUMJDROME and DMU25117J. 

15 FY AMPLE 137 : I ffl H"" nf cPNA c1ones EnC0din '' H "" 1 ^' 1 PR91?y7 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. . GenBank) and a proprietary 
EST DNA database (LIFESEQ*, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
20 homology search was performed using the computer program BLAST or BLAST2 (Altshu. et al., M^M 
Homology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater mat did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington. Seattle. Washington). The consensus sequence 
obtained therefrom is herein designated DNA56259. 
25 In light of an observed sequence homology between the DNA56259 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 3507924, the Incyte EST clone 3507924 was purchased 
and the cDNA insert was obtained and sequenced. It was found mat this insert encoded a fuU-Iengthprotem. 
The sequence of this cDNA insert is shown in Figure 303 and is herein designated as DNA68872-1620. 

Clone DNA68872-1620 contains a single open reading frame with an apparent translational initiation 
30 site at nucleotide positions 85-87 and ending at the stop codon at nucleotide positions 1267-1269 (Figure 303). 
The predicted polypeptide precursor is 394 ammo acids long (Figure 304). The full-length PR01387 protein 
shownmFigure^hasanestimated ■ 0 W.i#irf»W«»«»«di|Irf*«m 
of the full-length PR01387 sequence shown in Figure 304 (SEQ ID NO:422) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 1 9, a transmembrane domain from about 
35 annn oacid275»aboutamiiK>ac^ 

acid 79, from about amino acid 231 to about amino acid 234. from about amino acid 302 to about amino acid 
305 from about amino acid 307 to about amino acid 310 and from about amino acid 376 to about amino acd 
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379, and amino acid sequence blocks having homology to myelin pO protein from about amino acid 210 to about 
amino acid 239 and from about amino acid 92 to about amino acid 121. Clone DNA68872-1620 has been 
deposited with ATCC on August 25, 1998 and is assigned ATCC deposit no. 203160. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 304 (SEQ ID NO:422), evidenced significant 
5 homology between the PR01387 amino acid sequence and the following Dayhoff sequences; P_W36955, 
MYP0_HETFR, HS46KDAJ, AF049498J, MYO0_HUMAN, AF030454J, A53268, SHFTCRAJ, 
P_W14146 and GEN 12838. 

EXAMPLE 138: Isolation pf cPNA clones Encoding Human PRQ1384 
10 A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 

in Example 1 above. This consensus sequence is herein designated DNA54192. Based on the DNA54192 

sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence 

of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR01384. 
PCR primers (forward and reverse) were synthesized: 
15 forward PCR primer S'-TGCAGCCCCTGTGACACAAACTGG-S' (SEQ ID NO:425) 

reverse PCR primer 5 ' -CTG AGATAACCG AGCC ATCCTCCC AC-3 ' (SEQ ID NO:426) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA54192 

sequence which had the following nucleotide sequence: 

hybridization probe 

20 5 •^KjAGATAGCTGCTATGGGTTCTTCAGGCACAACTTAAC ATGGGAAG-3 * (SEQ ID NO:427) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 

isolate clones encoding the PR01384 gene using the probe oligonucleotide and one of the PCR primers. RNA 

for construction of the cDNA libraries was isolated from human fetal liver. 
25 DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PR01384 (designated herein as DNA71 159-1617 [Figure 305, SEQ ID NO: 423]; and the derived protein 

sequence for PR01384. 

The entire coding sequence of PR01384 is shown in Figure 305 (SEQ ID NO:423). Clone DNA71 159- 
1617 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 

30 1 82*184 and an apparent stop codon at nucleotide positions 869-871 . The predicted polypeptide precursor is 229 
amino acids long. The full-length PR01384 protein shown in Figure 306 has an estimated molecular weight of 
about 26,650 daltons and a pi of about 8.76. Additional features include a type II transmembrane domain at 
about amino acids 32-57, and potential N-glycosylation sites at about amino acids 68-71, 120-123* and 134-137. 
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

35 alignment analysis of the full-length sequence shown in Figure 306 (SEQ ID NO:424), revealed homology 
between the PRO 1384 amino acid sequence and the following Dayhoff sequences: AF054819J, HSAJ1687_1, 
AF009511_1,AB010710_1,GEN13595,HSAJ673_1,GEN 



487 



WO 99/63088 



PCT/US99/U2S2 



and NK13 RAT. 

acme DNA7U59-1617 has been deposited with ATCC and is assigned ATCC deposu no. 203135. 

by AMPLE 139: Vr ?f F BO 88 a ^"Miration . . 

^^Tuowmgme^^ 
5 DNA comprising the coding sequence of lull-length or mature PRO as disclosed herein is employed as 

aprobetoscre^forhon^ 

tissue cDNA libraries or human tissue genomic libraries. \ r „ . 

Hybridization and washing of filters containing either library DNAs is performed under the followmg 
high stringency conditio. Hybridization of radiolabeled PRO^ed probe «, is perfonned in a 
10 solutionof 50% formamide, 5* SSC, 0.1% SDS. 0.1% sodium pyroohosphate. 50 mM sodium phosphate P H 
6.8. 2x Denhardfs solution, and 10% dextran suifate at 42»C for 20 hours. Washing of the filters is performed 
in an aqueous solution of O.lx SSC and 0.1% SDS at 42«C. 

DNAs having a desired sequence identity with the DNA encoding full-length native sequence PRO can 

then be identified using standard techniques known in the art. 

15 

by ample 140: FTnTOftiwofPROtofi • ' . . 

^^hlLnple illustrates preparation of an unglycosy.a.ed form of PRO by recombmant exnress.cn m 
£ coli 

The DNA sequence encoding PRO is initially amplified using selected PGR primers. Thepnmers 
20 should contain restriction enzyme sites which correspond to the restriction enzyme sites on the selected 
expression vector. A variety of expression vectors may be employed. An example of a suitable vecto, •» 
PBR322 (derived from sec Bolivar etal., Gene, 2:95 (1977)) which contains genes for amptcdlm and 

te^acyclme resistance. The vector is digested with restriction enzyme and dephosphorylated. The PGR 
amplified sequences are then ligated into the vector. The vector will preferably inc.ude sequences wh.ch encode 
25 for an antibiotic resistance gene, a trp promoter, a polyhis leader (including the first six ST11 codons. potyhts 
seqfKDIX , and urokinase cleavage site), the PRO coding region, lambda transcriptional terminator, and an 

^ ^ ngauon mixture is then used to transform a selected E. coli strain using the methods described in 

Sambrcx*etal.«. Tr^^ 
30 colonies are then selected. Plasmid DNA can be isolated and confirmed by restriction analysis and DNA 

""Elected clones can be grown overnight in liquid culture medium such as LB broth supplemented with 
antibiotics. The overnight culture may subsequently be used to inoculate a larger scale culture. The ceils are 
then grown to a desired optical density, during which the expression promoter is turned on. 
35 After culturing the cells for several more hours, the cells can be harvested by centrifugatkm. The cell 

pellet obtained by the centrifugation can be solubilized using various agents known in the art, and the solubihzed 
PRO protein can then be purified using a metal chelating column under conditions that allow tight binding of the 
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protein. 

PRO may be expressed in E. coli in a poly-His tagged form, using the following procedure. The DN A 
encoding PRO is initially amplified using selected PCR primers. The primers will contain restriction enzyme 
sites which correspond to the restriction enzyme sites on the selected expression vector, and other useful 
sequences providing for efficient and reliable translation initiation, rapid purification on a metal chelation 
5 column, and proteolytic removal with enterokinase. The PCR-amplified, poly-His tagged sequences are then 
ligated into an expression vector, which is used to transform an E. coli host based on strain 52 (W3110 
fuhA(tonA) Ion galE rpoHts(htpRts) clpP(ladq). Transfonnants are first grown in LB containing 50 mg/ml 
carbenicillin at 30°C with shaking until an O.D.600 of 3-5 is reached. Cultures are then diluted 50-100 fold into 
CRAP media (prepared by mixing 3.57 g (NH^O*, 0.71 g sodium citrate*2H20, 1.07 g KC1, 5.36 g Difco 

10 yeast extract, 5.36 g Sheffield hycase SF in 500 mL water, as well as 1 10 mM MPOS, pH 7.3, 0.55% (w/v) 
glucose and 7 mM MgSO^ and grown for approximately 20-30 hours at 30°C with shaking. Samples are 
removed to verify expression by SDS-PAGE analysis, and the bulk culture is centrifuged to pellet the cells. Cell 
pellets are frozen until purification and refolding. 

E. coli paste from 0.5 to 1 L fermentations (6-10 g pellets) is resuspended in 10 volumes (w/v) in 7 M 

15 guanidine, 20 mM Tris, pH 8 buffer. Solid sodium sulfite and sodium tetrathionate is added to make final 
concentrations of 0. 1M and 0.02 M, respectively, and the solution is stirred overnight at 4°C. This step results 
in a denatured protein with all cysteine residues blocked by sulfitolization. The solution is centrifuged at 40,000 
rpm in a Beckman Ultracentifuge for 30 min. The supernatant is diluted with 3-5 volumes of metal chelate 
column buffer (6 M guanidine, 20 mM Tris, pH 7.4) and filtered through 0.22 micron filters to clarify. The 

20 clarified extract is loaded onto a 5 ml Qiagen Ni-NTA metal chelate column equilibrated in the metal chelate 
column buffer. The column is washed with additional buffer containing 50 mM imidazole (Calbiochem, Utrol 
grade), pH 7.4. The protein is eluted with buffer containing 250 mM imidazole. Fractions containing the 
desired protein are pooled and stored at 4°C. Protein concentration is estimated by its absorbance at 280 nm 
using the calculated extinction coefficient based on its amino acid sequence. 

25 The proteins are refolded by diluting the sample slowly into freshly prepared refolding buffer consisting 

of: 20 mM Tris, pH 8.6, 0.3 M NaCl, 2.5 M urea, 5 mM cysteine, 20 mM glycine and 1 mM EDTA. 
Refolding volumes are chosen so that the final protein concentration is between 50 to 100 micrograms/ml. The 
refolding solution is stirred gently at 4°C for 12-36 hours. The refolding reaction is quenched by the addition 
of TFA to a final concentration of 0.4% (pH of approximately 3). Before further purification of the protein, the 

30 solution is filtered through a 0.22 micron filter and acetonitrile is added to 2-10% final concentration. The 
refolded protein is chromatographed on a Poros Ri/H reversed phase column using a mobile buffer of 0.1% 
TFA with elution with a gradient of acetonitrile from 10 to 80%. Aliquots of fractions with A280 absorbance 
are analyzed on SDS polyacrylamide gels and fractions containing homogeneous refolded protein are pooled. 
Generally, the properly refolded species of most proteins are eluted at the lowest concentrations of acetonitrile 

35 since those species are the most compact with their hydrophobic interiors shielded from interaction with the 
reversed phase resin. Aggregated species are usually eluted at higher acetonitrile concentrations. In addition 
to resolving misfolded forms of proteins from die desired form, the reversed phase step also removes endotoxin 
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from the samples. , 

Fractions containing the desired folded PRO polypeptide are pooled and the acetonitrile removed using 
a gentle stream of nitrogen directed at the solution. Protein, are formulated into 20 mM Hepes. P H 6.8 with 
O.UMsodrumcMondear^ — 
equilibrated in the formulation buffer and sterile filtered. 
5 Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 

PYAMPLE 141: fmr*" nf PRO in "MWWWM" WP8 . . , 

This example illustrate, preparation of a potentially glycosylated form of PRO by recombman, 

expression in mammalian cells. 
10 The vector, pRK5 (see EP 307,247, published March 15, 1989). is employed as the expression vector. 

Optionally, the PRO DNA is ligated into pRK5 with selected restriction enzymes to allow insertion of the PRO 
DNA using ligation methods such as described in Sambrook e, al. , m The resulting vector is called P RK5- 

PRO 

In one embodiment, the seiected host cells may be 293 cdl, Hum*, 293 cell, (ATCCCCL 1573) are 
15 grown to confluence in tissue culture plates in medium such as DMEM supplemented with fetal calf serum and 
optionally, nutrient components and/or antibiotics. About 10 „ P RK5-PRO DNA is mixed with about 1 ,g 
DNA encoding the VA RNA gene rTlunimappaya et al. , £ejl, 21:543 ( 1982)) and disa>lved in 500 °* * 
Tris-HCl, 0.1 mM EDTA, 0.227 M C.C1, Tothis mixture is added, dropwise. 500 pi of 50 mM HEPES (pH 
7 35). 280 mM NaCl. 1.5 mM NaPO,. and a precipitate is allowed to form for 10 minutes at 25°C. The 
20 pnxipitate is suspended and added to the 293 cells and allowed to settle for about four hours at 37"C The 
culture medium is aspirated off and 2 ml of 20% glycerol in PBS U added for 30 seamds. The 293 cell, are 
Onen washed with serum free medium, fresh medium is added a*d me ceils are incubated for about 5 days 

Appreximatd^houBaftero^ 
„«dh»n (alone) or cuiture medium containing 200 ,Ci/ml "S-cysteine an, 200 ,Ci/m» ^-methionine. After 
25 a 12 hour incubation, the conditioned medium is collected, concentrated on a spin filter, and loaded onto a 15% 
SDSgel. The processed gel may be dried and exposed to film for a selected period of time to reveal the 
presence of PRO polypeptide. Tne cultures confining transfected cells may undergo further incubatton <» 
serum free medium) and the medium is tested in selected bioassays. 

ln an amative technique, PRO may be introduced into 293 cells transiently using the dextran sulfate 
30 method described by Somparyrac e, al.. Ppr M l . *»* 12=7575 (1981). 293 cells are grown to 
^fcmd density in a spinner flask and 7C» ,g P RK5-PRO DNA is added. Tbe cells are fu^cor^trated from 
fee spinner flask by centrifugation and washed with PBS. The DNA-dextran precipitate is incubated on the cell 
pellet for four hours. The cells are treated with 20% glycerol for 90 seconds, washed with tissue culture 
m edium, and reintroduced into the spinner flask containing tissue culture medium, 5 „g/ml bovine insulin and 
35 0 1,g/ml bovine transferrin. After about four days . the condmoned media is centrifuge* and filtered to remove 
cells tmd debris. The sample containing expressed PRO can men be concentrated and purified by any selected 
method, such as dialysis and/or column chromatography. 
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In another embodiment, PRO can be expressed in CHO cells. The pRK5-PRO can be transfected into 
CHO cells using known reagents such as CaP04 or DEAE-dextran. As described above, the cell cultures can 
be incubated, and the medium replaced with culture medium (alone) or medium containing a radiolabel such as 
M S-methionine. After determining the presence of PRO polypeptide, the culture medium may be replaced with 
serum free medium. Preferably, the cultures are incubated for about 6 days, and then the conditioned medium 
5 is harvested. The medium containing the expressed PRO can then be concentrated and purified by any selected 
method. 

Epitope-tagged PRO may also be expressed in host CHO cells. The PRO may be subcloned out of the 
pRK5 vector. The subclone insert can undergo PCR to fuse in frame with a selected epitope tag such as a pohy- 
his tag into a Baculovirus expression vector. The poly -his tagged PRO insert can then be subcloned into a SV40 

10 driven vector <^ntaining a selection marker such as DHFR for selection of stable clones. Finally, the CHO cells 
can be transfected (as described above) with the SV40 driven vector. Labeling may be performed, as described 
above, to verify expression. The culture medium containing the expressed poly-His tagged PRO can then be 
concentrated and purified by any selected method, such as by Ni 2+ -cheiate affinity chromatography. 

PRO may also be expressed in CHO and/or COS cells by a transient expression procedure or in CHO 

15 cells by another stable expression procedure. 

Stable expression in CHO cells is performed using the following procedure. The proteins are expressed 
as an IgG construct (immunoadhesin) , in which the coding sequences for the soluble forms {e.g. extracellular 
domains) of the respective proteins are fused to an IgG 1 constant region sequence containing the hinge, CH2 and 
CH2 domains and/or is a poly-His tagged form. 

20 Following PCR amplification, the respective DNAs are subcloned in a CHO expression vector using 

standard techniques as described in Ausubel ct aL, Current Protocols of Molecular Biology. Unit 3.16, John 
Wiley and SbnT(1997)~CHO exprasibn" vectors are constructedlb have compatible restriction sites 5* and 3 1 
of the DNA of interest to allow the convenient shuttling of cDNA's. The vector used expression in CHO cells 
is as described in Lucas et al., Nucl. Acids Res. 24:9 (1774-1779 (1996), and uses the SV40 early 

25 promoter/enhancer to drive expression of the cDNA of interest and dihydrofolate reductase (DHFR). DHFR 
expression permits selection for stable maintenance of the plasmid following transfectkm. 

Twelve micrograms of the desired plasmid DNA is introduced into approximately 10 million CHO cells 
using commercially available transfectkm reagents Superfect" (Quiagen), Dosper" or Fugene* (Boehringer 
Mannheim). The cells are grown as described in Lucas et al. t supra . Approximately 3 x 10* 7 cells are frozen 

30 in an ampule for further growth and production as described below. 

The ampules containing the plasmid DNA are thawed by placement into water bam and mixed by 
vortexing. The contents are pipetted into a centrifuge rube containing 10 mLs of media and centrifuged at 1000 
rpm for 5 minutes. The supernatant is aspirated and the cells are resuspended in 10 mL of selective media<0.2 
t*m filtered PS20 with 5% 0.2 /an diafiltered fetal bovine serum). The cells are then aliquoted into a 100 mL 

35 spinner containing 90 mL of selective media. After 1-2 days, the cells are transferred into a 250 mL spinner 
filled with 150 mL selective growth medium and incubated at 37?C. After another 2-3 days, 250 mL, 500 mL 
and 2000 mL spinners are seeded with 3 x 10 s ceils/mL. The cell media is exchanged with fresh media by 
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ccntrifugation and resuspension in production medium. Although any suitable CHO media may be employed, 
a production medium described in U.S. Patent No. 5, 122,469, issued June 16, 1992 may actually be used. A 
3L production spinner is seeded at 1.2 x 10» cells/mL. On day 0, the cell number pH ie determined. On day 
1, the spinner is sampled and sparging with filtered air is commenced. On day 2, the spinner is sampled, the 
temperature shifted to 33*C, and 30 mL of 500 g/L glucose and 0.6 mL of 10% antifoam (e.g.. 35% 
polydimethylsiloxane emulsion, Dow Coming 365 Medical Grade Emulsion) taken. Ttoughcnn me production, 
the pH is adjusted as necessary to keep it at around 7.2. After 10 days, or until the viability dropped below 
70% , the cell culture is harvested by centrifagarion and filtering through a 0.22 fan filter. Tteffltratt was either 
stored at 4°C or immediately loaded onto columns for purification. 

For the pory-His tagged constructs, the proteins are purified using a Ni-NTA column (Qiagen). Before 
purification, imidazole is added to the conditioned media to a concentration of 5 mM. The conditioned media 
is pumped onto a 6 ml Ni-NTA column equilibrated in 20 mM Hepes. pH 7.4. buffer containing 0.3 M NaCl 
and 5 mM imidazole at a flow rate of 4-5 ml/min. at 4«C. After loading, the column is washed with additional 
equilibration buffer and the protein eluted with equilibration buffer containing 0.25 M imidazole. The highly 
purified protein is subsequently desalted into a storage buffer containing 10 mM Hepes, 0.14 M NaCl and 4% 
15 marmitol, pH 6.8, with a 25 ml G25 Superfine (Pharmacia) column and stored at -80°C. 

Irnmunoadhesin (Fc^maining) constructs are purified from the conditioned media as follows. The 
conditioned medium is pumped onto a 5 ml Protein A column (Pharmacia) which had been equilibrated in 20 
mM Na phosphate buffer, pH 6.8. After loading, the column is washed extensively with equilibration buffer 
before elution with 100 mM citric acid. pH 3.5. The eluted protein is immediately neutralized by collecting 1 
20 ml fractions into tabes containing 275 a*L of 1 M Tris buffer. pH 9. The highly purified protein is subsequently 
desalted into storage buffer as described above for the poly-His tagged proteins. The homogeneity is assessed 
by SDS polyacrylamide gels and by N-terminal amino acid sequencing by Edman degradation. 

Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 



25 EXAMPLE 142: Ex pression of PRO in Yeast 

The following method describes recombinant expression of PRO in yeast. 
First, yeast expression vectors are constructed for intracellular production or secretion of PRO from 
the ADH2/GAPDH promoter. DNA encoding PRO and the promoter is inserted into suitable restriction enzyme 
sites in the selected plasmid to direct intracellular expression of PRO. For secretion. DNA encoding PRO can 
30 be cloned into the selected plasmid. together with DNA encoding the ADH2/GAPDH promoter, a native PRO 
signal peptide or other mammalian signal peptide, or. for example, a yeast alpha-factor or invertase secretory 
signal/leader sequence, and linker sequences (if needed) for expression of PRO. 

Yeast cells, such as yeast strain AB1 10, can then be transformed with the expression plasmids described 
above and cultured in selected fermentation media. The transformed yeast superaatants can be analyzed by 
35 precipitation with 10% trichloroacetic acid and separation by SDS-PAGE, followed by staining of the gels with 
Coomassie Blue stain. 
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Recombinant FRO can subsequently be isolated and purified by removing the yeast cells from the 
fermentation medium by centrifugation and then concentrating the medium using selected cartridge filters. The 
concentrate containing PRO may further be purified using selected column chromatography resins. 

Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 

5 EXAMPLE 143: Expression of PRO in Baculovinis-lnfected Insect Cells 

The following method describes recombinant expression of PRO in Baculovirus-infected insect cells. 
The sequence coding for PRO is fused upstream of an epitope tag contained within a baculovirus 
expression vector. Such epitope tags include poly-his tags and immunoglobulin tags (like Fc regions of IgG). 
A variety of plasmids may be employed, including plasmids derived from commercially available plasmids such 
10 as pVL1393 (Novagen). Briefly, the sequence encoding PRO or the desired portion of die coding sequence of 
PRO such as the sequence encoding the extracellular domain of a transmembrane protein or the sequence 
encoding the mature protein if the protein is extracellular is amplified by PCR with primers complementary to 
the 5' and 3' regions. The 5' primer may incorporate flanking (selected) restriction enzyme sites. The product 
is then digested with those selected restriction enzymes and subcloned into the expression vector. 
15 Recombinant baculovirus is generated by co-transfecting the above plasmid and BaculoGold™ vims 

DNA (Pharmingen) into Spodoptera frugiperda ( M Sf9*) cells ( ATCC CRL 1711) using lipofectin (commercially 
available from GIBCO-BRL). After 4-5 days of incubation at 28°C, the released viruses are harvested and used 
for further amplifications. Viral infection and protein expression are performed as described by O'Reilley et 
al., Baculovjnis expression vectors: A Laboratory Manual, Oxford: Oxford University Press (1994). 
20 Expressed poly-his tagged PRO can then be purified, for example, by Ni J+ -chelate affinity 

chromatography as follows. Extracts are prepared from recombinant virus-infected Sf9 cells as described by 
Rupert et al.. Nature. 362 :175-1797f9jB>7 M^flvTSfiTcelis are washed, resiispended in sonication buffer (75 
mL Hepes, pH 7.9; 12.5 mM MgCl 2 ; 0. 1 mM EDTA; 10% glycerol; 0. 1 % NP-40; 0.4 M KC1), and sonicated 
twice for 20 seconds on ice. The sonicates are cleared by centrifugation, and the supernatant is diluted 50-fold 
in loading buffer (50 mM phosphate, 300 mM NaCl, 10% glycerol, pH 7.8) and filtered through a 0.45 ^m 
filter. A Ni 2+ -NTA agarose column (commercially available from Qiagen) is prepared with a bed volume of 5 
mL, washed with 25 mL of water and equilibrated with 25 mL of loading buffer. The filtered cell extract is 
loaded onto the column at 0.5 mL per minute. The column is washed to baseline A^ with loading buffer, at 
which point fraction collection is started. Next, the column is washed with a secondary wash buffer (50 mM 
phosphate; 300 mM NaCl, 10% glycerol, pH 6.0), which elutes nonspeciftcally bound protein. After reaching 
A 2S0 baseline again, the column is developed with a 0 to 500 mM Imidazole gradient in the secondary wash 
buffer. One mL fractions are collected and analyzed by SDS-PAGE and silver staining or Western blot with 
Ni 2+ -NTA-conjugated to alkaline phosphatase (Qiagen). Fractions containing the eluted His 10 -tagged PRO are 
pooled and dialyzed against loading buffer. 

Alternatively, purification of the IgG tagged (or Fc tagged) PRO can be performed using known 
chromatography techniques, including for instance, Protein A or protein G column chromatography. 

Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 
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yx AMPLE 144: Proration of Antibodies that Bind PRO 

This example illustrates preparation of monoclonal antibodies which can specifically bind PRO. 
Techniques for producing the monoclonal antibodies are known in the art and are described, for 
instance, in Coding, supra. Immunogens that may be employed include purified PRO. fusion proteins containing 
PRO, and cells expressing recombinant PRO on the cell surface. Selection of the iinmunogen can be made by 
5 the skilled artisan without undue experimentation. 

Mice, such as Balb/c, are immunized with the PRO immunogen emulsified in complete Freund's 
adjuvant and injected subcutaneous* or intraperitoneaUy in an amount from 1-100 micrograms. Alternatively, 
the immunogen is emulsified in MPL-TDM adjuvant (Ribi Immunochemical Research. Hamilton. MT) and 
injected into the animal's hind foot pads. The immunized mice are then boosted 10 to 12 days later with 
10 additional immunogen emulsified in the selected adjuvant. Thereafter, for several weeks, the mice may also be 
boosted with additional immunization injections. Serum samples may be periodically obtained from the mice 
by retro-orbital bleeding for testing in ELISA assays to detect anti-PRO antibodies. 

After a suitable antibody titer has been detected, the animals "positive" for antibodies can be injected 
with a final intravenous injection of PRO. Three to four days later, the mice are sacrificed and the spleen cells 
15 are harvested. The spleen cells are then fused (using 35% polyethylene glycol) to a selected murine myeloma 
cell line such as P3X63AgU.l, available from ATCC. No. CRL 1597. The fusions generate hybridoma cells 
which can men be plated in 96 well tissue culture plates containing HAT (hypoxanthine, aimnopterin. and 
mymidine) medium to inhibit proliferation of non-fused cells, myeloma hybrids, and spleen cell hybrids. 

The hybridoma cells will be screened in an ELISA for reactivity against PRO. Determination of 
20 "positive" hybridoma cells secreting the desired monoclonal antibodies against PRO is within the skill in the art. 

The positive hybridoma cells can be injected intraperitoneaUy into syngeneic Balb/c mice to produce 
ascites containing the anti-PRO monoclonal antibodies. Alternatively, the hybridoma cells can be grown in tissue 
culture flasks or roller bottles. Purification of the monoclonal antibodies produced in the ascites can be 
accomplished using ammonium sulfate precipitation, followed by gel exclusion chromatography. Alternatively, 
25 affinity chromatography based upon binding of antibody to protein A or protein G can be employed. 

example i45r prfff^""" " f PRO irtrw"* »«"" Snecific Antibodies 

Native or recombinant PRO polypeptides may be purified by a variety of standard techniques in the art 

of protein purification. For example. pro-PRO polypeptide, mature PRO polypeptide, or pre-PRO polypeptide 
30 is purified by immunoaffmity chromatography using antibodies specific for the PRO polypeptide of interest. In 

general, an iirununoaffinity column is constructed by covalently coupling the anti-PRO polypeptide antibody to 

an activated chromatographic resin. 

Polyclonal immunoglobulins are prepared from immune sera either by precipitation with ammonium 

sulfate or by purification on unmobilfced Protein A (Pharmacia LKB Biotechnology. Piscataway. NJ.). 
35 likewise, monoclonal antibodies are prepared from mouse ascites fluid by anunonium sulfate precipitation or 

chromatography on immobilized Protein A. Partially purified immunoglobulin is covalently attached to a 

chromatographic resin such as CnBr-activated SEPH AROSE™ (Pharmacia LKB Biotechnology). The antibody 
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is coupled to the resin, the resin is blocked, and the derivative resin is washed according to the manufacturer's 
instructions. 

Such an irnmunoaffinity column is utilized m me purification of PRO polypeptide by prepaid 
from cells containing PRO polypeptide in a soluble form. This preparation is derived by solubilization of (be 
whole cell or of a subcellular fraction obtained via differential centrifugation by the addition of detergent or by 
5 other methods well known in the art. Alternatively, soluble PRO polypeptide containing a signal sequence may 
be secreted in useful quantity into the medium in which the cells are grown. 

A soluble PRO polypeptite<ontaining preparation is passed over the immunoafrmity column, and the 
column is washed under conditions that allow the preferential absorbance of PRO polypeptide (e.g. , high ionic 
strength buffers in the presence of detergent). Then, the column is eluted under conditions that disrupt 
10 antibody/PRO polypeptide binding (e.g. , a low pH buffer such as approximately pH 2-3, or a high concentration 
of a chaotrope such as urea or thiocyanate ion), and PRO polypeptide is collected. 

EXAMPLE 146: Drug Screening 

This invention is particularly useful for screening compounds by using PRO polypeptides or binding 

15 fragment thereof in any of a variety of drug screening techniques. The PRO polypeptide or fragment employed 
in such a test may either be free in solution, affixed to a solid support, borne on a cell surface, or located 
intracellularly. One method of drug screening utilizes eukaryotic or prokaryotic host cells which are stably 
transformed with recombinant nucleic acids expressing the PRO polypeptide or fragment. Drugs are screened 
against such transformed cells in competitive binding assays. Such cells, either in viable or fixed form, can be 

20 used for standard binding assays. One may measure, for example, the formation of complexes between PRO 
polypeptide or a fragment and the agent being tested. Alternatively, one can examine the diminution in complex 
formation between the PRO polypeptide and its target cell or target re^ 

Thus, the present invention provides methods of screening for drugs or any other agents which can 
affect a PRO polypeptide-associated disease or disorder. These methods comprise contacting such an agent with 

25 an PRO polypeptide or fragment thereof and assaying (I) for the presence of a complex between the agent and 
the PRO polypeptide or fragment, or (ii) for the presence of a complex between the PRO polypeptide or fragment 
and the cell, by methods well known in the art. In such competitive binding assays, the PRO polypeptide or 
fragment is typically labeled. After suitable incubation, free PRO polypeptide or fragment is separated from that 
present in bound form, and the amount of free or uncomplexed label is a measure of the ability of the particular 

30 agent to bind to PRO polypeptide or to interfere with the PRO polypeptide/cell complex. 

Another technique for drug screeningprovides high throughput screening for compounds having suitable 
binding affinity to a polypeptide and is described in detail in WO 84/03564, published on September 13, 1984. 
Briefly stated, large numbers of different small peptide test compounds are synthesized on a solid substrate, such 
as plastic pins or some other surface. As applied to a PRO polypeptide, the peptide test compounds are reacted 

35 with PRO polypeptide and washed. Bound PRO polypeptide is detected by methods well known in the art. 
Purified PRO polypeptide can also be coated directly onto plates for use in the aforementioned drug screening 
techniques. In addition, non-neutralizing antibodies can be used to capture the peptide and immobilize it on the 
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^e^bleofbu^^^ 

po^.,^ thereof. In this manner, the antibodies can be used to detect *e presence of any 
peptide which shares one or more antigenic determinants with PRO polypeptide. 



15 



20 



by AMPLE 147; FltiCTfi 1 , ... , 

T^goa, of rational drug design is to produce structural analogs of biologically acttve polypi of 

Snhibhors. Any of these examples can be used to fashion drugs which are more active or stable fore, of the 

PROpolypeptidcorwhi*^ 

Pin/Tf>rhnologv. 9: 19-21 (1991)). 

8 ^T^ approach, the three^onal structure of the PRO poiypepude, or of an PRO 
™ ly peptide^^ 

Cacombinationofthetwoapproa^es. Both the shape and charges of the PRO polypeptide must be ascertained 
u^s^ofthePROpolvpepU*^^^ 

ta both cases, re.evant structural infonnation is used to de^gn anaiogous PRO polypepude-ldce molecules or to 
i^fyeffKientinh^^ 

^orstabUi.asshownbyBr^andWeUs. B*» 21:7796,801 0«- — 
inhibit, agonisfc, or antagonists of native peptides as shown by Athauda « a!.. 111.742-746 

0993) ' HisalsopossiblctoisoUteatar^^^^ 

andthentosoWe its crystal structure. This approach, in principle, yields a phannacore upon which ^^j""* 
^gdesigncanbebased. Itispc^^^byp^ 
25 an*c4ies(an a -ids^ 

beusedtoidentUVandisolatepeptidesfrom ***** 
peptides would then act as the pharmacore. 

By virtue of the present invention, sufficient amounts of me PRO polypeptide may be made avauable 
30 ^r^^y^^*^^^ In addiUon, knowledge of Ote^O poly^ 
amino acid sequence provided herein wDl provide guidance to those employing computer modehng techmques 
in place of or in addition to x-ray crystallography. 
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PeposftofMajepal 

The following materials have been deposited with the American Type Culture Collection, 10801 
University Blvd., Manassas, VA 201 10-2209, USA (ATCC): 



5 Material 

DNA 16422-1209 
DNA16435-1208 
DNA21624-1391 
DNA23334-1392 

10 DNA26288-1239 
DNA26843-1389 
DNA26844-1394 
DNA30862-1396 
DNA35680-1212 

IS DNA4062M440 
DNA4416L-1434 
DNA44694-1500 
DNA45495-1550 
DNA47361-1154 

20 DNA47394-1572 
DNA48320-1433 
DNA48334-1435 
DNA48606-1479 
DNA49141-1431 

25 DNA49142-1430 
DNA49143-1429 
DNA49647-1398 
DNA49819-1439 
DNA49820-1427 

30 — DNA49821-1562" 
DNAS2192-1369 
DNA52598-1518 
DNA53913-1490 
DNA53978-1443 

35 DNA53996-1442 
DNA56041-1416 
DNAS6047-1456 
DNA56050-1455 
DNA561 10-1437 

40 DNA561 13-1378 
DNA56410-1414 
DNA56436-1448 
DNA56855-1447 
DNA56859-1445 

45 DNA5686M510 
DNA56865-1491 
DNA56866-1342 
DNA56868-1209 
DNA56869-154S 

50 DNA56870-1492 
DNA57033-1403 
DNA57037-1444 
DNA57129-1413 



Table 2 

ATCC Pep, frfo. 

209929 

209930 

209917 

209918 

209792 

203099 

209926 

209920 

209790 

209922 

209907 

203114 

203156 

209431 

203109 

209904 

209924 

203040 

203003 

203002 

203013 

209919 

209931 

209932 

209981 

203042 

203107 

203162 

209983 

209921 

203012 

209948 

203011 

203113 

203049 

209923 

209902 

203004 

203019 

209952 

203022 

203023 

203024 

203161 

209925 

209905 

209903 

209977 



Deposit Pate 

June 2, 1998 

June 2, 1998 

June 2, 1998 

June 2, 1998 

April 21, 1998 

August 4, 1998 

June 2, 1998 

June 2, 1998 

April 21, 1998 

June 2, 1998 

May 27. 1998 

August 11, 1998 

August 25, 1998 

November 7, 1997 

August 11, 1998 

May 27, 1998 
June 2, 1998 
July 1, 1998 
June 23, 1998 
June 23, 1998 
June 23, 1998 
June 2, 1998 
June 2. 1998 

June 2, 1998 

June 16, 1998 
July 1, 1998 
August 11, 1998 
August 25, 1998 
June 16, 1998 
June 2, 1998 
June 23, 1998 
June 9, 1998 
June 23, 1998 
August 11, 1998 
July 1, 1998 
June 2, 1998 
May 27, 1998 
June 23, 1998 
June 23, 1998 
June 9, 1998 
June 23, 1998 
June 23, 1998 
June 23, 1998 
August 25, 1998 
June 2, 1998 
May 27, 1998 
May 27, 1998 
June 16, 1998 
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DNA57690-1374 
DNA57693-1424 
DNA57694-1341 
DNA57695-1340 
DNA57699-1412 
5 DNA57702-1476 
DNA57704-1452 
DNA57708-1411 
DNA57710-1451 
DNA5771 1-1501 
10 DNA57827-1493 
DNA57834-1339 
DNA57836-1338 
DNA57838-1337 
DNA57844-1410 
15 DNA58721-1475 
DNA58723-1588 
DNA58737-1473 
DNA58743-1609 
DNA58846-1409 
20 DNA58848-1472 
DNA58849-1494 
DNA58850-1495 
DNA58853-1423 
DNA58855-1422 
25 DNA59205-1421 
DNA5921 1-1450 
DNA59213-1487 
DNA59214-1449 
DNA59215-1425 
30 DNA59220-1514 
DNA59488-1603 
DNA59493-1420 
DNA59497-1496 
DNA59588-1571 
35 DNA59603-1419 
DNA59605-1418 
DNA59606-1471 
DNA59607-1497 
DNA59609-1470 
40 DNA59610-1559 
DNA59612-1466 
DNA59613-1417 
DNA59616-1465 
DNA59619-1464 
45 DNA59620-1463 
DNA59625-1498 
DNA59767-1489 
DNA59776-1600 
DNA59777-1480 
50 DNA59820-1549 
DNA59827-1426 
DNA59828-1608 
DNA59838-1462 
DNA59839-1461 
55 DNA59841-1460 
DNA59842-1502 



209950 

203008 

203017 

203006 

203020 

209951 

209953 

203021 

203048 

203047 

203045 

209954 

203025 

203014 

203010 

203110 

203133 

203136 

203154 

209957 

209955 

209958 

209956 

203016 

203018 

203009 

209960 

209959 

203046 

209961 

209962 

203157 

203050 

209941 

203106 

209944 

203005 

209945 

209957 

209963 

209990 

209947 

203007 

209991 

203041 

209989 

209992 

203108 

203128 

203111 

203129 

203089 

203158 

209976 

209988 

203044 

209982 
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June 9, 1998 
June 23, 1998 
June 23, 1998 
June 23, 1998 
June 23, 1998 
June 9, 1998 
June 9, 1998 
June 23, 1998 
July 1, 1998 
July 1, 1998 
July 1, 1998 
June 9, 1998 
June 23, 1998 
June 23, 1998 
June 23, 1998 
August 11, 1998 
August 18, 1998 
August 18, 1998 
August 25, 1998 
June 9, 1998 
June 9, 1998 
June 9, 1998 
June 9. 1998 
June 23, 1998 
June 23, 1998 
June 23, 1998 
June 9, 1998 
June 9, 1998 
July 1, 1998 
June 9, 1998 
June 9, 1998 
August 25, 1998 
July 1, 1998 
June 4, 1998 
August 11, 1998 
June 9, 1998 
June 23, 1998 
June 9, 1998 
June 9, 1998 
June 9, 1998 
June 16, 1998 
June 9, 1998 
June 23, 1998 
June 16, 1998 
July 1, 1998 
June 16, 1998 
June 17, 1998 
August 11, 1998 
August 18, 1998 
August 11, 1998 
August 18, 1998 
August 4, 1998 
August 25, 1998 
June 16, 1998 
June 16, 1998 
July 1, 1998 
June 16, 1998 
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10 



15 



20 



25 



30 



35 



40 



45 



DNA59846-1503 


209978 


June 16, 1998 


DNA59847-1511 


203098 


August 4, 1998 


DNA59848-1512 


203088 


August 4, 1998 


DNA59849-1504 


209986 


June 16. 1998 


DNA59853-1505 


209985 


June 16, 1998 


DNA59854-1459 


209974 


June 16, 1998 


DNA60283-1484 


203043 


July 1, 1998 


DNA60615-1483 


209980 


June 16, 1998 


DNA60619-1482 


209993 


June 16, . 1998 


DNA60621-1516 


203091 


August 4, 1998 


DNA60622-1525 


203090 


August 4, 1998 


DNA60625-1507 


209975 


June 16, 1998 


DNA60627-1508 


203092 


August 4, 1998 


DNA60629-1481 


209979 


June 16, 1998 


DNA61755-1554 


203112 


August 11, 1998 


DNA61873-1574 


203132 


August 18, 1998 


DNA62814-1521 


203093 


August 4, 1998 


DNA62872-1509 


203100 


August 4, 1998 


DNA62876-1517 


203095 


August 4, 1998 


DNA62881-1515 


203096 


August 4, 1998 


DNA64852-1589 


203127 


August 18, 1998 


DNA64884-1527 


203155 


August 25, 1998 


DNA64890-1612 


203131 


August 18, 1998 


DNA65412-1523 


203094 


August 4, 1998 


DNA66308-1537 


203159 


August 25, 1998 


DNA66309-1538 


203235 


September 15, 1998 


DNA67004-1614 


203115 


August 11, 1998 


DNA68869-1610 


203164 


August 25, 1998 


DNA68872-1620 


203160 


August 25, 1998 


DNA71 159-1617 


203135 


August 18, 1998 



These deposit were made under the provisions of the Budapest Treaty on the International Recognition 
" of the Derx>sit of Microorgan^ 
Treaty). This assures maintenance of a viable culture of the deposit for 30 years from the date of deposit. The 
deposits will be made available by ATCC under the terms of the Budapest Treaty, and subject to an agreement 
between Genentech, Inc. and ATCC, which assures permanent and unrestricted availability of the progeny of 
the culture of the deposit to the public upon issuance of the pertinent U.S. patent or upon laying open to the 
public of any U.S. or foreign patent application, whichever comes first, and assures availability of the progeny 
to one determined by the U.S. Commissioner of Patents and Trademarks to be entitled thereto according to 35 
USC §122 and the Commissioner's rules pursuant thereto (including 37 CFR §1. 14 with particular reference to 
886 OG 638). 

The assignee of the present application has agreed that if a culture of the materials on deposit should 
die or be lost or destroyed when cultivated under suitable conditions, the materials will be promptly replaced on 
notification with another of the same. Availability of the deposited material is not to be construed as a license 
to practice the invention in contravention of the rights granted under the authority of any government in 
accordance with its patent laws. 

The foregoing written specification is considered to be sufficient to enable one skilled in the art to 
practice the invention. The present invention is not to be limited in scope by the construct deposited, since the 
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deposited embodiment is intended as a single illustration of certain aspects of the invention and any constructs 
thai are functionally equivalent are within the scope of this invention. The deposit of material herein does not 
constitute an admission that the written description herein contained is inadequate to enable the practice of any 
aspect of the invention, including the best mode thereof, nor is it to be construed as limiting the scope of the 
claims to the specific illustrations that it represents. Indeed, various modifications of the invention in addition 
to those shown and described herein will become apparent to those skilled in the art from the foregoing 
description and fall within the scope of the appended claims. 
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WHAT flS CLAIMED fife 

1. Isolated nucleic acid having at least 80% sequence identity to a nucleotide sequence that 
encodes a polypeptide comprising an amino acid sequence selected from the group consisting of the amino acid 
sequence shown in Figure 2 (SEQ ID NO:2) t Figure 4 (SEQ ID NO:6), Figure 6 (SEQ ID NO:8), Figure 9 
(SEQ ID NO:14), Figure 12 (SEQ ID NO:20), Figure 15 (SEQ ID NO:23), Figure 18 (SEQ ID NO:28), Figure 
5 20 (SEQ ID NO:30), Figure 23 (SEQ ID NO:33), Figure 25 (SEQ ID NO:36) t Figure 27 (SEQ ID NO:41), 
Figure 30 (SEQ ID NO:47), Figure 32 (SEQ ID NO:52), Figure 34 (SEQ ID NO:57), Figure 36 (SEQ ID 
NO:62), Figure 38 (SEQ ID NO:67), Figure 41 (SEQ ID NO:73), Figure 47 (SEQ ID NO:84), Figure 49 (SEQ 
ID NO:95) t Figure 51 (SEQ ID NO:97), Figure 53 (SEQ ID NO:99), Figure 57 (SEQ ID NO: 103), Figure 64 
(SEQ ID NO:113), Figure 66 (SEQ ID NO: 115), Figure 68 (SEQ ID NO: 117), Figure 70 (SEQ ID NO: 119), 

10 Figure 72 (SEQ ID NO:124), Figure 74 (SEQ ID NO: 129), Figure 76 (SEQ ID NO: 135), Figure 79 (SEQ ID 
NO: 138), Figure 83 (SEQ ID NO: 146), Figure 85 (SEQ ID NO: 148), Figure 88 <SEQ ID NO: 151), Figure 90 
(SEQ ID NO: 153), Figure 93 (SEQ ID NO: 156), Figure 95 (SEQ ID NO: 158), Figure 97 (SEQ ID NO: 160), 
Figure 99 (SEQ ID NO: 165), Figure 101 (SEQ ID NO: 167), Figure 103 (SEQ ID NO: 169), Figure 105 (SEQ 
ID NO:171), Figure 109 (SEQ ID NO: 175), Figure 111 (SEQ ID NO: 177), Figure 113 (SEQ ID NO: 179), 

15 Figure 1 15 (SEQ ID NO: 181), Figure 1 17 (SEQ ID NO: 183), Figure 120 (SEQ ID NO: 189), Figure 122 (SEQ 
ID NO: 194), Figure 125 (SEQ ID NO: 197), Figure 127 (SEQ ID NO: 199), Figure 129 (SEQ H> NO:201), 
Figure 131 (SEQ ID NO:203), Figure 133 (SEQ ID NO:205), Figure 135 (SEQ ID NO:207), Figure 137<SEQ 
ID NO:209), Figure 139 (SEQ ID NO:211), Figure 141 (SEQ ID NO:213), Figure 144 (SEQ ID NO:216), 
Figure 147 (SEQ ID NO:219), Figure 149 (SEQ ID NO:221), Figure 151 (SEQ ID NO:223), Figure 153 (SEQ 

20 ID NO:225), Figure 155 (SEQ ID NO:227), Figure 157 (SEQ ID NO:229), Figure; 159 (SEQ ID NO:231), 
Figure 161 (SEQ ID NO:236), Figure 163 (SEQ ID NO:241), Figure 165 (SEQ ID NO:246), Figure 167 (SEQ 
© NO:248)7Fi^ 169 (SEQ ID NO:250), Figu^l7 

Figure 176 (SEQ ID NO:258), Figure 178 (SEQ ID NO: 260). Figure 180 (SEQ ID NO: 262), Figure 182 (SEQ 
ID NO:264), Figure 184 (SEQ ID NO:266), Figure 186 (SEQ ID NO: 268), Figure 188 (SEQ ID NO:270), 

25 Figure 190 (SEQ ID NO:272), Figure 192 (SEQ ID NO:274), Figure 194 (SEQ ID NO:276), Figure 196 (SEQ 
ID NO:278), Figure 198 (SEQ ID NO:281), Figure 200 (SEQ ID NO:283), Figure 202 (SEQ ID NO:285), 
Figure 204 (SEQ ID NO:287), Figure 206 (SEQ ID NO:289), Figure 208 (SEQ ID NO:291), Figure 210 (SEQ 
ID NO:293), Figure 212 (SEQ ID NO:295), Figure 214 (SEQ ID NO:297), Figure 216 (SEQ ID NO:299), 
Figure 218 (SEQ ID NO:301), Figure 220 (SEQ ID NO: 303), Figure 226 (SEQ ID NO:309), Figure 228 (SEQ 

30 ID NO:314), Figure 230 (SEQ ID NO:319), Figure 233 (SEQ ID NO:326), Figure 235 (SEQ ID NO:334), 
Figure 238 (SEQ ID NO:340), Figure 240 (SEQ ID NO:345), Figure 242 (SEQ ID NO:347), Figure 244 (SEQ 
ID NO:349), Figure 246 (SEQ ID NO:351), Figure 248 (SEQ ID NO:353), Figure 250 (SEQ ID NO:355), 
Figure 252 (SEQ ID NO:357), Figure 254 (SEQ ID NO:359), Figure 256 (SEQ ID NO:361), Figure 258 (SEQ 
ID NO:363), Figure 260 (SEQ ID NO:365), Figure 262 (SEQ ID NO:367), Figure 264 (SEQ ID NO:369), 

35 Figure 266 (SEQ ID NO:371), Figure 268 (SEQ ID NO:373), Figure 270 (SEQ ID NO:375), Figure 272 (SEQ 
ID NO:377), Figure 274 (SEQ ID NO:379), Figure 276 (SEQ ID NO:381), Figure 278 (SEQ ID NO:387), 
Figure 280 (SEQ ID NO:389) f Figure 282 (SEQ ID NO:394), Figure 284 (SEQ ID NO:399), Figure 286 (SEQ 
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ID NO:401), Figure 288 (SEQ ID NO:403), Figure 290 (SEQ ID NO:408), Figure 292 (SEQ ID N0:410). 
Figure 294 (SEQ ID NO:412), Figure 296 (SEQ ID NO:414), Figure 298 (SEQ ID NO:416). Figure 300 (SEQ 
ID NO:418), Figure 302 (SEQ ID NO:420). Figure 304 (SEQ ID NO:422) and Figure 306 (SEQ ID NO:424). 

2. The nucleic acid sequence of Claim 1 , wherein said nucleotide sequence comprises a nucleotide 
5 sequence selected from the group consisting of the sequence shown in Figure 1 (SEQ ID NO: 1 ), Figure 3 (SEQ 
ID NO:5). Figure 5 (SEQ ID NO:7), Figure 8 (SEQ ID NO:13), Figure 1 1 (SEQ ID NO:19), Figure 14 (SEQ 
ID NO:22), Figure 17 (SEQ ID NO:27), Figure 19 (SEQ ID NO:29). Figure 22 (SEQ ID NO:32), Figure 24 
(SEQ ID NO:35), Figure 26 (SEQ ID NO:40), Figure 29 (SEQ ID NO:46), Figure 31 (SEQ ID NO:51), Figure 
33 (SEQ ID NO:56), Figure 35 (SEQ ID NO:61). Figure 37 (SEQ ID NO:66), Figure 40 (SEQ ID NO:72), 
10 Figure 46 (SEQ ID NO:83). Figure 48 (SEQ ID NO:94), Figure 50 (SEQ ID NO:96), Figure 52 (SEQ ID 
NO:98). Figure 56 (SEQ ID NO:102), Figure 63 (SEQ ID NO: 1 12), Figure 65 (SEQ ID NO:l 14), Figure 67 
(SEQ ID NO:116), Figure 69 (SEQ ID NO:118), Figure 71 (SEQ ID NO:123), Figure 73 (SEQ ID NO:128), 
Figure 75 (SEQ ID NO:134), Figure 78 (SEQ ID NO:137). Figure 82 (SEQ ID NO:145), Figure 84 (SEQ ID 
NO:147), Figure 87 (SEQ ID NO:150), Figure 89 (SEQ ID NO.152), Figure 92 (SEQ ID NO: 155), Figure 94 
15 (SEQ ID NO: 157), Figure 96 (SEQ ID NO:159). Figure 98 (SEQ ID NO:164), Figure 100 (SEQ ID NO:166), 
Figure 102 (SEQ ID NO:168). Figure 104 (SEQ ID NO:170), Figure 108 (SEQ ID NO:174), Figure 1 10 (SEQ 
ID NO:176). Figure 112 (SEQ ID NO:178), Figure 114 (SEQ ID NO:180). Figure 116 (SEQ ID NO:182), 
Figure 119 (SEQ ID NO:188). Figure 121 (SEQ ID NO:193). Figure 124 (SEQ ID NO:196). Figure 126 (SEQ 
ID NO:198), Figure 128 (SEQ ID NO:200), Figure 130 (SEQ ID NO:202). Figure 132 (SEQ ID NO:204), 
20 Figure 134 (SEQ ID NO:206). Figure 136 (SEQ ID NO:208), Figure 138 (SEQ ID NO:210). Figure 140 (SEQ 
ID NO:212), Figure 143 (SEQ ID NO:215), Figure 146 (SEQ ID NO:218), Figure 148 (SEQ ID NO:220). 
Figure 150 (SEQ ID NO:222), Figure 152 (SEQ ID NO:224). Figure-154 (SEQ ID NO:226), Figure 156 (SEQ 
ID NO:228), Figure 158 (SEQ ID NO:230), Figure 160 (SEQ ID NO:235). Figure 162 (SEQ ID NO:240), 
Figure 164 (SEQ ID NO:245), Figure 166 (SEQ ID NO:247), Figure 168 (SEQ ID NO:249), Figure 170 (SEQ 
25 ID NO:252), Figure 173 (SEQ ID NO:255), Figure 175 (SEQ ID NO:257), Figure 177 (SEQ ID NO:259), 
Figure 179 (SEQ ID NO:261), Figure 181 (SEQ ID NO:263), Figure 183 (SEQ ID NO:265), Figure 185 (SEQ 
ID NO:267), Figure 187 (SEQ ID NO:269). Figure 189 (SEQ ID NO:271), Figure 191 (SEQ ID NO:273), 
Figure 193 (SEQ ID NO:275), Figure 195 (SEQ ID NO:277). Figure 197 (SEQ ID NO:280), Figure 199 (SEQ 
ID NO:282), Figure 201 (SEQ ID NO:284). Figure 203 (SEQ ID NO:286), Figure 205 (SEQ ID NO:288). 
30 Figure 207 (SEQ ID NO:290), Figure 209 (SEQ ID NO:292), Figure 21 1 (SEQ ID NO:294), Figure 213 (SEQ 
ID NO:296), Figure 215 (SEQ ID NO:298), Figure 217 (SEQ ID NO:300), Figure 219 (SEQ ID NO:302). 
Figure 225 (SEQ ID NO:308), Figure 227 (SEQ ID NO:313), Figure 229 (SEQ ID NO:318), Figure 232 (SEQ 
ID NO:325), Figure 234 (SEQ ID NO:333). Figure 237 (SEQ ID NO:339). Figure 239 (SEQ ID NO:344), 
Figure 241 (SEQ ID NO:346), Figure 243 (SEQ ID NO:348), Figure 245 (SEQ ID NO:350), Figure 247 (SEQ 
35 ID NO:352), Figure 249 (SEQ ID NO:354), Figure 251 (SEQ ID NO:356), Figure 253 (SEQ ID NO:358), 
Figure 255 (SEQ ID NO:360), Figure 257 (SEQ ID NO:362), Figure 259 (SEQ ID NO:364). Figure 261 (SEQ 
ID NO:366), Figure 263 (SEQ ID NO:368), Figure 265 (SEQ ID NO:370), Figure 267 (SEQ ID NO:372), 
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Figure 269 (SEQ ID NO:374), Figure 271 (SEQ ID NO:376), Figure 273 (SEQ ID NO:378), Figure 275 (SEQ 
ID NO:380>, Figure 277 (SEQ ID NO:386), Figure 279 (SEQ ID NO:388), Figure 281 (SEQ ID NO:393), 
Figure 283 (SEQ ID NO:398), Figure 285 (SEQ ID NO:400), Figure 287 (SEQ ID NO:402), Figure 289 (SEQ 
ID NO:407), Figure 291 (SEQ ID NO:409), Figure 293 (SEQ ID N0:411), Figure 295 (SEQ ID NO:413), 
Figure 297 (SEQ ID NO:415) t Figure 299 (SEQ ID NO:417), Figure 301 (SEQ ID NO:419), Figure 303 (SEQ 
5 ID NO:421) and Figure 305 (SEQ ID NO:423). 

3 . The nucleic acid of Claim 1 , wherein said nucleotide sequence comprises a nucleotide sequence 
selected from the group consisting of the full-length coding sequence of the sequence shown in Figure 1 (SEQ 
ID NO:l), Figure 3 (SEQ ID NO:5), Figure 5 (SEQ ID NO:7), Figure 8 (SEQ ID NO: 13), Figure 11 (SEQ ID 

10 NO: 19), Figure 14 (SEQ ID NO:22), Figure 17 (SEQ ID NO:27), Figure 19 (SEQ ID NO:29), Figure 22 (SEQ 
ID NO:32) f Figure 24 (SEQ ID NO:35), Figure 26 (SEQ ID NO:40), Figure 29 (SEQ ID NO:46) v Figure 31 
(SEQ ID NO:51), Figure 33 (SEQ ID NO:56), Figure 35 (SEQ ID NO:61), Figure 37 (SEQ ID NO:66), Figure 
40 (SEQ ID NO:72), Figure 46 (SEQ ID NO:83), Figure 48 (SEQ ID NO:94), Figure 50 (SEQ ID NO:96), 
Figure 52 (SEQ ID NO:98), Figure 56 (SEQ ID NO: 102), Figure 63 (SEQ ID NO: 112), Figure 65 (SEQ ID 

15 NO:114), Figure 67 (SEQ ID NO:116), Figure 69 (SEQ ID NO:ll8), Figure 71 (SEQ ID NO:123), Figure 73 
(SEQ ID NO:128), Figure 75 (SEQ ID NO:134), Figure 78 (SEQ ID NO:137), Figure 82 (SEQ ID NO:145), 
Figure 84 (SEQ ID NO: 147), Figure 87 (SEQ ID NO: 150), Figure 89 {SEQ ID NO: 152). Figure 92 (SEQ ID 
NO: 155), Figure 94 (SEQ ID NO: 157), Figure 96(SEQ ID NO: 159), Figure 98 (SEQ ID NO: 164), Figure 100 
(SEQ ID NO: 166), Figure 102 (SEQ ID NO: 168), Figure 104 (SEQ ID NO: 170), Figure 108 (SEQ ID 

20 NO: 174), Figure 110 (SEQ ID NO: 176), Figure 1 12 (SEQ ID NO: 178), Figure 1 14 (SEQ ID NO: 180), Figure 
116 (SEQ ID NO: 182), Figure 119 (SEQ ID NO: 188), Figure 121 (SEQ ID NO: 193), Figure 124 (SEQ ID 
NO: 196), Figure 126 (SEQ ID NO: 198), Figure 128 (SEQ ID NO:200), Figure 130 (SEQ ID NO:202), Figure" 
132 (SEQ ID NO:204), Figure 134 (SEQ ID NO:206), Figure 136 (SEQ ID NO:208), Figure 138 (SEQ ID 
NO:210), Figure 140 (SEQ ID NO:212), Figure 143<SEQ ID NO:215), Figure 146 (SEQ ID NO:218), Figure 

25 148 (SEQ ID NO:220), Figure 150 (SEQ ID NO:222), Figure 152 {SEQ ID NO:224), Figure 154 (SEQ ID 
NO:226), Figure 156 (SEQ ID NO:228), Figure 158 (SEQ ID NO:230), Figure 160 (SEQ ID NO:235), Figure 
162 (SEQ ID NO:240), Figure 164 (SEQ ID NO:245), Figure 166 (SEQ ID NO: 247), Figure 168 (SEQ ID 
NO:249), Figure 170 (SEQ ID NO:252), Figure 173 (SEQ ID NO:255), Figure 175 (SEQ ID NO:257), Figure 
177 (SEQ ID NO:259), Figure 179 (SEQ ID NO:261), Figure 181 (SEQ ID NO:263), Figure 183 (SEQ ID 

30 NO:265), Figure 185 (SEQ ID NO:267), Figure 187 (SEQ ID NO:269), Figure 189 (SEQ ID NO:271), Figure 
191 (SEQ ID NO:273), Figure 193 (SEQ ID NO:275), Figure 195 (SEQ ID NO:277), Figure 197 (SEQ ID 
NO:280), Figure 199 (SEQ ID NO:282), Figure 201 (SEQ ID NO:284), Figure 203 (SEQ ID NO:286) t Figure 
205 (SEQ ID NO:288), Figure 207 (SEQ ID NO:290), Figure 209 (SEQ ID NO:292), Figure 211 (SEQ ID 
NO:294), Figure 213 (SEQ ID NO:296), Figure 215 (SEQ ID NO:298), Figure 217 (SEQ ID NO:300) V Figure 

35 219 (SEQ ID NO:302), Figure 225 (SEQ ID NO:308), Figure 227 (SEQ ID NO:313), Figure 229 (SEQ ID 
NO:318), Figure 232 (SEQ ID NO:325), Figure 234 (SEQ ID NO:333), Figure 237 (SEQ ID NO:339), Figure 
239 (SEQ ID NO:344), Figure 241 (SEQ ID NO: 346), Figure 243 (SEQ ID NO:348), Figure 245 (SEQ ID 
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NO:350). Figure 247 (SEQ ID NO:352). Figure 249 (SEQ ID NO:354). Figure 251 (SEQ ID NO:356). Figure 
253 (SEQ ID NO:358). Figure 255 (SEQ ID NO:360), Figure 257 (SEQ ID NO:362), Figure 259 (SEQ ID 
NO-.364). Figure 261 (SEQ ID NO:366). Figure 263 (SEQ ID NO:368), Figure 265 (SEQ ID NO:370), Figure 
267 (SEQ ID NO:372). Figure 269 (SEQ ID NO:374), Figure 271 (SEQ ID NO:376), Figure 273 (SEQ ID 
NO:378), Figure 275 (SEQ ID NO:380), Figure 277 (SEQ ID NO:386), Figure 279 (SEQ ID NO:388). Figure 
281 (SEQ ID NO:393), Figure 283 (SEQ ID NO:398), Figure 285 (SEQ ID NO:400), Figure 287 (SEQ ID 
NO-.402), Figure 289 (SEQ ID NO:407), Figure 291 (SEQ ID NO:409), Figure 293 (SEQ ID NO:411), Figure 
295 (SEQ ID NO-.413). Figure 297 (SEQ ID NO:415), Figure 299 (SEQ ID NO:417), Figure 301 (SEQ ID 
NO:419), Figure 303 (SEQ ID NO:421) or Figure 305 (SEQ ID NO:423). 

4. Isolated nucleic acid which comprises the full-length coding sequence of the DNA deposited 
under any ATCC accession number shown in Table 2. 

5. A vector comprising the nucleic acid of Claim 1 . 

6. The vector of Claim 5 operably linked to control sequences recognized by a host cell 
transformed with the vector. 



7. A host cell comprising the vector of Claim 5. 
20 8. The host cell of Claim 7 wherein said cell is a CHO cell. 

9. The host cell of Claim 7 wherein said cell is an E. coli. 

10. The host cell of Claim 7 wherein said cell is a yeast cell. 

25 

11. A process for producing a PRO polypeptides comprising culturing the host cell of Claim 7 
under conditions suitable for expression of said PRO polypeptide and recovering said PRO polypeptide from the 
cell culture. 

30 12. Isolated PRO polypeptide having at least 80% sequence identity to an amino acid sequence 

selected from the group consisting of the amino acid sequence shown in Figure 2 (SEQ ID NO:2), Figure 4 (SEQ 
ID NO:6), Figure 6 (SEQ ID NO:8), Figure 9 (SEQ ID NO:14), Figure 12 (SEQ ID NO:20). Figure 15 (SEQ 
ID NO:23). Figure 18 (SEQ ID NO:28), Figure 20 (SEQ ID NO:30), Figure 23 (SEQ ID NO:33). Figure 25 
(SEQ ID NO:36), Figure 27 (SEQ ID NO:41), Figure 30 (SEQ ID NO:47), Figure 32 (SEQ ID NO:52), Figure 

35 34 (SEQ ID NO:57). Figure 36 (SEQ ID NO:62), Figure 38 (SEQ ID NO:67). Figure 41 (SEQ ID NO:73). 
Figure 47 (SEQ ID NO:84), Figure 49 (SEQ ID NO:95), Figure 51 (SEQ ID NO:97), Figure 53 (SEQ ID 
NO:99), Figure 57 (SEQ ID NO:103), Figure 64 (SEQ ID NO:113). Figure 66 (SEQ ID NO:115), Figure 68 



504 



WO 99/63088 



PCT/US99/12252 



(SEQ ID NO:l 17), Figure 70 (SEQ ID N0:119), Figure 72 (SEQ ID NO:124) t Figure 74 (SEQ ID NO:129). 
Figure 76 (SEQ ID NO:135), Figure 79 (SEQ ID NO:138) f Figure 83 (SEQ ID NO:146), Figure 85 (SEQ ID 
NO:148), Figure 88 (SEQ ID N0:151), Figure 90 (SEQ ID NO: 153), Figure 93 (SEQ ID NO:156), Figure 95 
(SEQ ID NO: 158), Figure 97 (SEQ ID NO: 160), Figure 99 (SEQ ID NO: 165), Figure 101 (SEQ ID NO: 167), 
Figure 103 (SEQ ID NO: 169), Figure 105 (SEQ ID NO: 171), Figure 109 (SEQ ID NO: 175), Figure 1 11 (SEQ 
5 ID NO:177), Figure 113 (SEQ ID NO:179), Figure 115 (SEQ ID NO:181), Figure 117 (SEQ ID NO:183), 
Figure 120 (SEQ ID NO:189), Figure 122 (SEQ ID NO: 194), Figure 125 (SEQ ID NO:197), Figure 127 (SEQ 
ID NO:199), Figure 129 (SEQ ID NO:201), Figure 131 (SEQ ID NO:203), Figure 133 (SEQ ID NO:205), 
Figure 135 (SEQ ID NO:207), Figure 137 (SEQ ID NO:209) ( Figure 139 (SEQ ID NO:211), Figure 141 (SEQ 
ID NO:213), Figure 144 (SEQ ID NO:216) t Figure 147 (SEQ ID NO:219), Figure 149 (SEQ ID NO:221), 

10 Figure 151 (SEQ ID NO:223), Figure 153 (SEQ ID NO:225), Figure 155 (SEQ ID NO:227), Figure 157 (SEQ 
ID NO:229), Figure 159 (SEQ ID NO:231), Figure 161 (SEQ ID NO:236), Figure 163 (SEQ ID NO:241) t 
Figure 165 (SEQ ID NO:246), Figure 167 (SEQ ID NO:248), Figure 169 (SEQ ID NO:250), Figure 171 (SEQ 
ID NO:253), Figure 174 (SEQ ID NO:256), Figure 176 (SEQ ID NO:258), Figure 178 (SEQ ID NO:260), 
Rgure 180 (SEQ ID NO:262), Figure 182 (SEQ ID NO:264), Figure 184 (SEQ ID NO:266), Figure 186 (SEQ 

15 ID NO:268), Figure 188 (SEQ ID NO:270), Figure 190 (SEQ ID NO:272), Figure 192 (SEQ ID NO:274), 
Figure 194 (SEQ ID NO:276), Figure 196 (SEQ ID NO:278) t Figure 198 (SEQ ID NO:281), Figure 200 (SEQ 
ID NO:283), Figure 202 (SEQ ID NO:285), Figure 204 (SEQ ID NO:287), Figure 206 (SEQ ID NO:289), 
Figure 208 (SEQ ID NO:291), Figure 210 (SEQ ID NO:293), Figure 212 (SEQ ID NO:295), Figure 214 (SEQ 
ID NO:297), Figure 216 (SEQ ID NO:299) t Figure 218 (SEQ ID NO:301), Figure 220 (SEQ ID NO:3Q3), 

20 Figure 226 (SEQ ID NO:309), Figure 228 (SEQ ID NO:314). Figure 230 (SEQ ID NO:319), Figure 233 (SEQ 
ID NO:326), Figure 235 (SEQ ID NO:334), Figure 238 (SEQ ID NO:340), Figure 240 (SEQ ID NO:345), 
Figure 242 (SEQ ID NO:347)7Figure 244 (SEQ IDNO:349)7Fig^246"(SEQID NOT351)7Fig^2^~(^ 
ID NO:353), Figure 250 (SEQ ID NO:355), Figure 252 (SEQ ID NO:357), Figure 254 <SEQ ID NO:359), 
Figure 256 (SEQ ID NO:361), Figure 258 (SEQ ID NO:363), Figure 260 (SEQ ID NO:365), Figure 262 (SEQ 

25 ID NO:367), Figure 264 (SEQ ID NO:369), Figure 266 (SEQ ID NO:371), Figure 268 (SEQ ID NO:373), 
Figure 270 (SEQ ID NO:375), Figure 272 (SEQ ID NO:377) f Figure 274 (SEQ ID NO:379), Figure 276 (SEQ 
ID NO:381), Figure 278 (SEQ ID NO:387), Figure 280 (SEQ ID NO:389), Figure 282 (SEQ ID NO:394), 
Figure 284 (SEQ ID NO:399), Figure 286 (SEQ ID NO:401), Figure 288 (SEQ ID NO:403), Figure 290 (SEQ 
ID NO:408), Figure 292 (SEQ ID NO:410), Figure 294 (SEQ ID NO:412) f Figure 296 (SEQ ID NO:414), 

30 Figure 298 (SEQ ID NO:416), Figure 300 (SEQ ID NO:418), Figure 302 (SEQ ID NO:420), Figure 304 (SEQ 
ID NO:422) and Figure 306 (SEQ ID NO:424). 

13. Isolated PRO polypeptide having at least 80% sequence identity to the amino acid sequence 
encoded by a nucleic acid molecule deposited under any ATCC accession number shown in Table 2. 

35 

14. A chimeric molecule comprising a polypeptide according to Claim 12 fused to a heterologous 
amino acid sequence. 
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15. The chimeric molecule of Claim 14 wherein said heterologous amino acid sequence is an 
epitope tag sequence. 

16. The chimeric molecule of Claim 14 wherein said heterologous amino acid sequence is a Fc 
region of an immunoglobulin. 

17. An antibody which specifically binds to a PRO polypeptide according to Claim 12. 

18. The antibody of Claim 17 wherein said antibody is a monoclonal antibody. 

19. The antibody of Claim 17 wherein said antibody is a humanized antibody. 

20. The antibody of Claim 17 wherein said antibody is an antibody fragment. 



21 . An isolated nucleic acid molecule which has at least 80% sequence identity to a nucleic acid 
15 which comprises a nucleotide sequence selected from the group consisting of that shown in Figure 1 (SEQ ID 
NO:l). Figure 3 (SEQ ID NO:5), Figure 5 (SEQ ID NO:7). Figure 8 (SEQ ID NO:13), Figure 11 (SEQ ID 
NO:19), Figure 14 (SEQ ID NO:22). Figure 17 (SEQ ID NO:27), Figure 19(SEQ1D NO:29). Figure 22 (SEQ 
ID NO:32), Figure 24 (SEQ ID NO:35), Figure 26 (SEQ ID NO:40), Figure 29 (SEQ ID NO:46), Figure 31 
(SEQ ID NO:51), Figure 33 (SEQ ID NO:56), Figure 35 (SEQ ID NO:61), Figure 37 (SEQ ID NO:66), Figure 
20 40 (SEQ ID NO:72), Figure 46 (SEQ ID NO:83). Figure 48 (SEQ ID NO:94). Figure 50 (SEQ ID NO:96), 
Figure 52 (SEQ ID NO:98). Figure 56 (SEQ ID NO:102). Figure 63 (SEQ ID NO:112), Figure 65 (SEQ ID 
NO:114), Figure 67 (SEQ ID NO:l 16), Figure 69 (SEQ ID NO: 1 18). Figure 71 (SEQ ID NO:123), Figure 73 
(SEQ ID NO:128), Figure 75 (SEQ ID NO:134). Figure 78 (SEQ ID NO:137). Figure 82 (SEQ ID NO: 145), 
Figure 84 (SEQ ID NO:147). Figure 87 (SEQ ID NO:150), Figure 89 (SEQ ID NO: 152), Figure 92 (SEQ ID 
25 NO:155).Figure94(SE Q roNO:157),Rgure^ 

(SEQ ID NO:166), Figure 102 (SEQ ID NO:168), Figure 104 (SEQ ID NO:170), Figure 108 (SEQ ID 
NO: 174), Figure 110 (SEQ ID NO: 176). Figure 1 12 (SEQ ID NO: 178). Figure 1 14 (SEQ ID NO: 180), Figure 
116 (SEQ ID NO:182). Figure 119 (SEQ ID NO:I88). Figure 121 (SEQ ID NO:193). Figure 124 (SEQ ID 
NO: 196). Figure 126 (SEQ ID NO: 198). Figure 128 (SEQ ID NO:200). Figure 130 (SEQ ID NO:202). Figure 
30 132 (SEQ ID NO-.204). Figure 134 (SEQ ID NO.206). Figure 136 (SEQ ID NO:208). Figure 138 (SEQ ID 
NO-.210), Figure 140 (SEQ ID NO:212). Figure 143 (SEQ ID NO:215). Figure 146 (SEQ ID NO:218). Figure 
148 (SEQ ID NO:220), Figure 150 (SEQ ID NO:222). Figure 152 (SEQ ID NO:224). Figure 154 (SEQ ID 
NO:226). Figure 156 (SEQ ID NO:228). Figure 158 (SEQ ID NO:230). Figure 160 (SEQ ID NO:235). Figure 
162 (SEQ ID NO:240). Figure 164 (SEQ ID NO:245). Figure 166 (SEQ ID NO:247). Figure 168 (SEQ ID 
35 NO:249). Figure 170 (SEQ ID NO:252). Figure 173 (SEQ ID NO:255). Figure 175 (SEQ ID NO:257). Figure 
177 (SEQ ID NO:259). Figure 179 (SEQ ID NO:261). Figure 181 (SEQ ID NO:263). Figure 183 (SEQ ID 
NO:265). Figure 185 (SEQ ID NO:267). Figure 187 (SEQ ID NO:269). Figure 189 (SEQ ID NO:271). Figure 
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191 (SEQ ID NO:273), Figure 193 (SEQ ID NO:275), Figure 195 (SEQ ID NO:277), Figure 197 (SEQ ID 
NO:280), Figure 199 (SEQ ID NO:282), Figure 201 (SEQ ID NO:284), Figure 203 (SEQ ID NO:286), Figure 
205 (SEQ ID NO:288) v Figure 207 (SEQ ID NO:290) t Figure 209 (SEQ ID NO:292), Figure 211 (SEQ ID 
NO:294), Figure 213 (SEQ ID NO:296), Figure 215 (SEQ ID NO:298), Figure 217 (SEQ ID NO:300), Figure 
219 (SEQ ID NO:302), Figure 225 (SEQ ID NO:308), Figure 227 (SEQ ID NO:313), Figure 229 (SEQ ID 
5 NO:318), Figure 232 (SEQ ID NO:325), Figure 234 (SEQ ID NO:333) t Figure 237 (SEQ ID NO:339), Figure 
239 (SEQ ID NO:344), Figure 241 (SEQ ID NO:346), Figure 243 (SEQ ID NO:348), Figure 245 (SEQ ID 
NO:350), Figure 247 (SEQ ID NO:352), Figure 249 (SEQ ID NO:354), Figure 251 (SEQ ID NO:356), Figure 
253 (SEQ ID NO:358) t Figure 255 (SEQ ID NO:360), Figure 257 (SEQ ID NO:362), Figure 259 (SEQ ID 
NO:364), Figure 261 (SEQ ID NO:366). Figure 263 (SEQ ID NO:368), Figure 265 (SEQ ID NO:370) t Figure 

10 267 (SEQ ID NO:372), Figure 269 (SEQ ID NO:374), Figure 271 (SEQ ID NO:376) f Figure 273 (SEQ ID 
NO:378), Figure 275 (SEQ ID NO:380), Figure 277 (SEQ ID NO: 386), Figure 279 (SEQ ID NO:388), Figure 
281 (SEQ ID NO:393), Figure 283 (SEQ ID NO:398), Figure 285 (SEQ ID NO:400) f Figure 287 (SEQ ID 
NO:402), Figure 289 (SEQ ID NO:407), Figure 291 (SEQ ID NO:409), Figure 293 (SEQ ID NO:411), Figure 
295 (SEQ ID NO:413), Figure 297 (SEQ ID NO:415), Figure 299 (SEQ ID NO:417) t Figure 301 (SEQ ID 

15 NO:419), Figure 303 (SEQ ID NO:421) and Figure 305 (SEQ ID NO:423). 

22. An isolated nucleic acid molecule which has at least 80% sequence identity to the full-length 
coding sequence of a nucleotide sequence selected from the group consisting of that shown in Figure 1 (SEQ ID 
NO:l), Figure 3 (SEQ ID NO:5), Figure 5 (SEQ ID NO:7), Figure 8 (SEQ ID NO: 13), Figure 11 (SEQ ID 

20 NO: 19), Figure 14 (SEQ ID NO:22), Figure 17 (SEQ ID NO:27), Figure 19 (SEQ ID NO:29), Figure 22 (SEQ 
ID NO:32), Figure 24 (SEQ ID NO:35), Figure 26 (SEQ ID NO:40), Figure 29 (SEQ ID NO:46), Figure 31 
(SEQ ID NO:51), Figure 33 (SEQ ID NO:56), Figure 35 (SEQ ID NO:61), Figure 37 (SEQ ID NO:66), Figure 
40 (SEQ ID NO:72), Figure 46 (SEQ ID NO:83), Figure 48 (SEQ ID NO:94), Figure 50 (SEQ ID NO:96), 
Figure 52 (SEQ ID NO:98), Figure 56 (SEQ ID NO: 102), Figure 63 (SEQ ID NO: 112), Figure 65 (SEQ ID 

25 NO:114), Figure 67 (SEQ ID NO:116), Figure 69 (SEQ ID NO:l 18), Figure 71 (SEQ ID NO:123). Figure 73 
(SEQ ID NO: 128), Figure 75 (SEQ ID NO: 134), Figure 78 (SEQ ID NO: 137), Figure 82 (SEQ ID NO: 145), 
Figure 84 (SEQ ID NO:147), Figure 87 (SEQ ID NO:150), Figure 89 (SEQ ID NO: 152), Figure 92 (SEQ ID 
NO: 155), Figure 94 (SEQ ID NO: 157), Figure 96 (SEQ ID NO: 159), Figure 98 (SEQ ID NO: 164), Figure 100 
(SEQ ID NO:166), Figure 102 (SEQ ID NO:168), Figure 104 (SEQ ID NO:170), Figure 108 (SEQ ID 

30 NO: 174), Figure 110 (SEQ ID NO: 176), Figure 112 (SEQ ID NO: 178), Figure 1 14 (SEQ ID NO: 180), Figure 
116 (SEQ ID NO: 182), Figure 119 (SEQ ID NO: 188), Figure 121 (SEQ ID NO: 193), Figure 124 (SEQ ID 
NO:196), Figure 126 (SEQ ID NO:198), Figure 128 (SEQ ID NO:200), Figure 130 (SEQ ID NO:202), Figure 
132 (SEQ ID NO:204), Figure 134 (SEQ ID NO:206), Figure 136 (SEQ ID NO:208), Figure 138 (SEQ ID 
NO:210), Figure 140 (SEQ ID NO:212), Figure 143 (SEQ ID NO:215), Figure 146 (SEQ ID NO:218), Figure 

35 148 (SEQ ID NO:220), Figure 150 (SEQ ID NO:222), Figure 152 (SEQ ID NO:224), Figure 154 (SEQ ID 
NO:226), Figure 156 (SEQ ID NO:228), Figure 158 (SEQ ID NO:230), Figure 160 (SEQ ID NO:235), Figure 
162 (SEQ ID NO:240), Figure 164 (SEQ ID NO:245), Figure 166 (SEQ ID NO:247), Figure 168 (SEQ ID 
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NO:249), Figure 170 (SEQ ID NO:252). Figure 173 (SEQ ID NO:255). Figure 175 (SEQ ID NO:257). Figure 
177 (SEQ ID NO:259), Figure 179 (SEQ ID NO:261), Figure 181 (SEQ ID NO:263). Figure 183 (SEQ ID 
NO:265). Figure 185 (SEQ ID NO:267), Figure 187 (SEQ ID NO:269). Figure 189 (SEQ ID NO:271). Figure 
191 (SEQ ID NO:273). Figure 193 (SEQ ID NO:275). Figure 195 (SEQ ID NO:277), Figure 197 (SEQ ID 
NO:280), Figure 199 (SEQ ID NO:282). Figure 201 (SEQ ID NO:284), Figure 203 (SEQ ID NO:286), Figure 
5 205 (SEQ ID NO:288), Figure 207 (SEQ ID NO:290). Figure 209 (SEQ ID NO:292), Figure 211 (SEQ ID 
NO:294), Figure 213 (SEQ ID NO:296). Figure 215 (SEQ ID NO:298), Figure 217 (SEQ ID NO:300), Figure 
219 (SEQ ID NO:302), Figure 225 (SEQ ID NO:308), Figure 227 (SEQ ID NO:313), Figure 229 (SEQ ID 
NO:318). Figure 232 (SEQ ID NO:325), Figure 234 (SEQ ID NO:333). Figure 237 (SEQ ID NO:339), Figure 
239 (SEQ ID NO:344). Figure 241 (SEQ ID NO:346). Figure 243 (SEQ ID NO:348). Figure 245 (SEQ ID 
10 NO:350). Figure 247 (SEQ ID NO:352>. Figure 249 (SEQ ID NO:354), Figure 251 (SEQ ID NO:356), Figure 
253 (SEQ ID NO:358). Figure 255 (SEQ ID NO:360), Figure 257 (SEQ ID NO:362), Figure 259 (SEQ ID 
NO:364). Figure 261 (SEQ ID NO:366), Figure 263 (SEQ ID NO:368). Figure 265 (SEQ ID NO:370), Figure 
267 (SEQ ID NO:372). Figure 269 (SEQ ID NO:374). Figure 271 (SEQ ID NO:376), Figure 273 (SEQ ID 
NO:378). Figure 275 (SEQ ID NO:380). Figure 277 (SEQ ID NO:386), Figure 279 (SEQ ID NO:388), Figure 
15 281 (SEQ ID NO:393), Figure 283 (SEQ ID NO:398), Figure 285 (SEQ ID NO:400). Rgure 287 (SEQ ID 
NO:402). Figure 289 (SEQ ID NO:407), Figure 291 (SEQ ID NO:409), Figure 293 (SEQ ID NO:41 1). Figure 
295 (SEQ ID NO:413), Figure 297 (SEQ ID NO:415), Figure 299 (SEQ ID NO:417), Figure 301 (SEQ ID 
NO:419), Figure 303 (SEQ ID NO:421) and Figure 305 (SEQ ID NO:423). 

20 23 . An isolated extracellular domain of of PRO polypeptide. 

24. An isolated PRO polypeptide lacking its associated signal peptide. 

25. An isolated polypeptide having at least about 80% amino acid sequence identity to an 
25 extracellular domain of of PRO polypeptide. 

26. An isolated polypeptide having at least about 80% amino acid sequence identity to a PRO 
polypeptide lacking its associated signal peptide. 
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FIGURE 1 



CGGACGCGTGGGTGCGAGGCGAAGGTGACCGGGGACCGAGCATTTCAGATCTGCTCGGTAGA 

CCTGGTGCACC^CC^CCA^TTGGCTGCAAGGCTGGTGTGTCTCCGGACACTACCTTCTAGG 

GTTTTCCACCCAGCTTTCACCAAGGCCrCCCCTGTTGTGAAGAATTCCATCAOSAAG^ 

ATGGCTGTTAACACCTAGCAGGGAATATGCCACCAAAAC^GAATTG<^T-CCGGCGTGGGA 

GAACTGGCCAAGAACTCy^GAGGCAGCATTCGAACCATCGATGGAAAAAATATTTAAAATO 

GATCAGATGGGAAGATGGTTTGTTGCTGGAGGGGCTGCTGTTGG^ 

CTATGGCTTGGGACTGTCTAATGAGATTGGAGCTATTGAAAAGGCTGTAATTTGGCCTCAGT 

ATGTCAAGGATAGAATTCATTCCACCTATATGTACTTAGCAGGQAGTATTGGTTTAACM 

TTGTCTGCC^TAGCAATCAGCAGAACGCCTGTTCTCATGAACTTCATGATGAGAGGCTC^ 

GGTGACAATTGGTGTGACCTTTGCAGCCATGGTTGGAGCTGGAATGCrGGTACGATCAATAC 

CATATGACC^GAGCCCAGGCCCAAAGCATCTTGCTTGGTTGCTAC^TTCTGGTGTGATGGGT 

GCAGTGGTGGCTCCTCTGACAATATTAGGGGGTCCTCTTCTCATCAGAGCrcCATGGTACA 

AGCTGGC^TTGTGGGAGGCCTCTCCACTGTGGCCATGTGTGCGCCC^TGAAAAGTTTCTGA 

ACATGGGTGCACCCCTGGGAGTGG<KXrrGGGTCT 

TTTCTTCCACCTACCACCGTGGCTGGTGCCACTCTTTACTCAGTGGCT^TGTACX^ 
AGTTCTTTTCAGCATGTTCCTTCTGTATGATACCCAGAAAGTAATCAAGCGTGCAGAAGTAT 
CACCAATGTATGGAGTTCAAAAATATGATCCCATTAACTCGATGCTGAGTATCTACATGGAT 
-ACATTAAATATATTTATCCGAGTTGCAACTATGCTGGCAACT 

AAGTGACTCAGCTTCTGGCTTCTCTGCTACAT<yVAATATCITGTTO 

ATTAAATAGTTTGTACAAGCAGCTTTCGTTGAAGTTTAGAAGATAAGAAACATGTCATCATA 

TTTAAATGTTCCGGTAATGTGATGCCTCAGGTCTGCCTTTTTTTCTGGAGAATAAATGCAGT 

AATCCTCTCCCAAATAAGCAC^CACATTTTCAATTCTCATGTTTGAGTGATTTTAAAATGTT 

TTGGTGAATGTGAAAACTAAAGTTTGTGTCATGAGAATGTAAGTCTTTTTTCTACTTTAAAA 
TTTAGTAGGTTCACTGAGTAACTAAAATTTAGCAAACCT 

GCAGAATATTGTAATTAATGTCATAAGTGATTTGGAGCTTTGGTAAAGGGACCAGAGAGAAG 

GAGTCACCTGCAGTCTTTTGTTTTTTTAAATACTTAGAACTTAGCACTTGTGTTATTGATTA 

GTGAGGAGCXZAGTAAGAAACATCTGGGTATTTGGAAACAAGTGGTCATTGTTACATTCATTT 

GCTGAACTTAACAAAACTGTTCATCCTGAAACAGGCACAGGTGATGCATTCrCCTGCTGTTG 

CTTCTCAGTGCTCTCTTTCCAATATAGATGTGGTCATGTTTGAGTTGTACAGAATGTTAATC 

ATACAGAGAATCCTTGATGGAATTATATATGTGTGTTTTACTTTTGAATGTTACAAAAGGAA 

ATAACTTTAAAACTATTCTCAAGAGAAAATATTCAAAGCATCAAATATGTTGC^^ 
AATACAAACAGTATACTCATG 
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FTGTTRE 2 

MLAARLVCLRTLPSRVFHPAFTKASPVVKNSITKNQWLLTPSREYATKTRIGIRRGRTGQEL 
KEAALEPS^KIFKIIX2MGRWFVAGGAAVGI^ALOTYGLGLSNEIGAIEKAVIWPQYVKDRI 
HSTYfTTIiAGSIGLTALSAIAISRTPVLMNFMMRGSWVTIGVTFAAMVGAGMLVRSIPYDQSP 

GPKHLAWLLHSGVMGAVVAPLTIIjGGPLLI RAAWYTAG I VGGLSTVAMCAPSEKFIiNMGAPL 
GVGLGLVFVSSLGSMFLPPTTVAGATLYSVAMYGGLVLFSMFLLYDTQKVIKRAEVSPMYGV 

QKYDPINSMLSIYMDTLNIFMRVATMLATGGNRKK 
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FIGURE 3 

GAAGGCTGCCTCGCTGK3TCC<3AATTCGG 

CGCGK3CTTCGGCGGCTTCCACCTAGACACCTAACAGTCGCGGAGCCGG 

GGTCGGCACGGGGAGTCGGGCGGTCITGTGCATCTTGGCTACCTGTGGGTCGAA 

ACATCGGAGACTGGTTCAGGAGCATCCCGGCGATCACGCGCTATTGGTTCCSCCGCCA^ 

GCCGTGCCCTTGGTCGGC&AACTCGGCCT 

AGCCTTCCTTTATCGCTTTCAGATTTGGAGGC 

GTCCAGGAACTGG ATT TCTTTATTTGGTCAATTT^ 

CTTGAAACAGGAGCTTTTGATGGGAGGCCAGCAGACT 

GATTTGCATCGTGATTACTGGCTTAGCAATGGATAT^ 

TGTGAGTACTTTATGTCTGGGCCGAGCT 

ACACGATTTAAGGCCTGCTATTTACCCTGGGTTATC 

CrrCGGTAATCAATGAGCTTATTGGAAATCTGGTTGGACATCTTTATT 

GATACCCAATGGACTTGGGAGGAAGAAATTTTCTATCG^ 

CTGCCCAGTAGGAGAGGAGGAGTATCAGGATTTGGTGTGCCCCCTGCTAGCATGAGGCGAGC 
TGCTGATCAGAATGGCGGAGGCGGGAGACACAACT 
AGTG^GGGGCGGCCTCGGGCAGCCGCTCCT 
CACITAACAACTGCGTTCTGGCTAACACTGTTC 

AGTACGAGACAA AGTT TCTTAAATCCCGAAGAAAAATATAAGTGTTCCACAAGTTTCACGAT 

TCTCATTCAAGTCCTTACTGCroTGAAGAACAAATACCAACTGTG 

TACATTTTTTGGTGTCTTCTCTTCTCCCCTTTCCGTCTGAATAATGGGTTTTAG 

AATCTGCT<MC^TTGAGCTGGGGCTGGGTCACCAAACCCTTCCCAAAAGGA 

TCTTGCACACATGCCTCTCTCX:CACTTTTCCCAACCCCCAC^T^ 

CCCATAAAATTGCTCTGCCCTTGACAGGTTCTGTTATTTATTGACTTTTGCCAA 

ACAACAATCATATTCACGTTATTTTCCCCTTTTGGTGGCAGAAOT 

AAGA CAGC CACX3GATGAAGCGTTTCTCAGCTTTTGGAA 

AACCGTTTGCCACTCTTCAGATATTTTTTATAAA 

CAGATTGGTTATTAATGAGATACGAGGGTTGGTGCTGGG 

TCAAGACTGTAGTGGAGTTGCAGCTAACATGGGTTAGGTTTAAACCATGGGGGATGCACCCC 

TTTGCX3TTTCATATGTAGCCCTACTGGCTTTGTGTAGCTGGAGTAGTTGGGTTGCTTTC 

TAGGAGGATCCAGATCATGTTGGCTACAGGGAGATGCTCTCTTTGAGAGGTCCTGGGCATTG 

ATTCCCATTTCAATCTCATTCTGGATATGTGT^ 

-CGCTATTTAAATGTCACTIT-T-T-TC 

GAGGAAGGCGCAGCTCCTGTCTGCACGTAGATCATTTTTTAAAGCTAATGTAAG 

AGGGAATAACATGATTTAAGGTTGAAATGGCTTTAGAATCATTTGGGTTTGAGGGTGTGTTA 

TTTTGAGTCATGAATCTACAAGCTCTGTGAATCAGACCTVGCT 

TCGTAGGTGGGCTTTTCCTATCAGAGCTTGG 

TGGCTTTTCACACAGTTATTTTATTTTATGACGTTATCTGAAAG 

ATTGAGTGGCTGTCACACTTTGAGGCAACTAAAAAGGCTTCAA^ 

TTCAGGAAACATTGTCCTCTAACAGTATGACTATTC 

GTGTGTTATCCTAGGAAATGAGAGTTGGCAAACAACTTCT 

TACTTCTCCATATTTAATTTATATGATAAAATAGGTC 

TGTTTTGTTGTTC^TCTGTGGCCACAATAAAGTTTACTTGTAAAATTTTA^ 

CCAATTATGTTGCACGTACACTCATTGTACAGGCGTGGAGACTCATTGTATGTATAAG 

TTTCTGAC^TGAGTGACCCGGAGTCTCTGGTGTAC^ 

CAGTCATTTTTTCCTAAAGGTTTACi^ 

ATGAAGTTATTCCTCTTAAACATGGTTAGGAAGCTG^ 

ATGTTTCTGGAATAATTTTACCAAAACAAGCT 

TGACAGTGGATTCTCTTTACAAATGGAAAAAAAAAATCCTTATTTTGTATAA 
TTTTTGTAAACTAATCCTTTTTATTGGTAAAAATTGTAAATTAAAATGTG 
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FTftlJRE 4 

MSDIGDWFRSIPAITRYWFAATVAVPLVGKLGLISPAYLFLWPEAFLYRFQIWRPITATFYF 
PVGPGTCFLYLVNLYFLYQYSTRLETGAFDGRPADYLFMLLFIWICIVITGIjAMDMQLLiMIP 
LIMSVLYVWAQI^MIVSFWFGTRFKACYLPWVILGFNYIIGGSVINELIGNLVGHLYFFL 

MFRYPMDLGGRNFIjSTPQFTjYRWLPSRRGGVSGFGVPPASMRR^ 
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FIGURES 

GGGGCCGCGGTCTAGGGCGGCTACGTGTGTTC 

GTAGCTTCTATCCTGGGGGCTGAGCGACTGCGGGCCAGCT 

CCTTGTGGCCCAAAGGCCTAACCGGGGTCCGGQ3GTCTGGCCTAGG 

CCITTGGGGCGGGAT^CTGCGGAAGAAGAAGACGAGGTGGAG 

CGGGGTTCCTGCGAGGCCCAGACTGGTCCATCCCCATCTT 

GAAGTTAACTGCAAAGGAGGGCATGTGATAACTC 

GGCCTGTGTTCCCCTTGTTT 

ATCAGGAATACAAAGAACTAGTTGAAAAGCTGTTAG 

AATGAAGATCAATTTCAAGAAGCATGCACrTCTCCTCT 

CATTTTGCAACCTGTGTTGGCAGCAGAAGATTTTACT^^ 

AAAACATTGAAATGCAGCnX5CAAGCCATTC 

GACTGCTTAACCGATGGCTCTGATGTXMTCAGTGACCTTGAA 

GAGGGAAGTTCTTAGAAAATCAAAAGAGGAATATGACC^^ 

AACAGTTATCAGAGGCTAAAACAGAAGAGCCCACAGTGCATTCCAG 

AATAATTCCCAAGGGGATGGTGAACATTTTGCAC^ 

TGCTAATCAGTCAATAGAACCTTTGGGAAGAAAAGTGGAAAGGTCTGAAACTTCCTCCCTCC 

CACAAAAAGGCCTGAAGATTCCTGGCTTAGAGCATGOjAGCATTGAAGG 

TTATCAGTACTTGGAACAGAAGAACTTCGGCAACGAGAAC^ 

TAAGTTGATGTCCATGAGAAAGGATATGAGGACTAAACAGATACAAAATATGGA<3CAGA 

GAAAACCCACTGGGGAGGTAGAGGAAATGACAGAGAAACCAGAAATGACAG 

GAAACATTACTAAAGAGGAGATTGCTTGCAGAGAAACTCAAAGAAGAAGTTA 

ATAATTAAGAACAATTTAACAAAATGGAAGTTCAAATTGTCTTAA 

CTTACACTG 
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USURER 

MAAEEEDEVEWVVESIMFLRGPDWSIPILDFVEQKCEWCKGGHVITPGSPEPVILVACVP 
LVFDDEEESKLTYTEIHQEYKELVEKLLEGYLKEIGINEDQFQEACTSPLAKTHTSQAILQP 
VLAAEDFTIFKAMMVQKNIEMQLQAIRIIQERNGVLPDCLTDGSDWSDLEHEEMKILREVL 
RKSKEEYDQEEERKRKKQLSEAKTEEPTVHSSEAAIMNNSQGDGEHFAHPPSEVKMHFANQS 
IEPLGRKVERSETSSLPQKGLKIPGLEHASIEGPIANLSVLGTEELRQREHYLKQKRDKLMS 
MRKDMRTKQIQN^QKGKPTGEVEEMTEKPEMTAEEKQTLLKRRLIAEKLKEEVIN^ 
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FIGURE 7 

GGGCACAGCACATGTGAAGTTTTTGATGATGAAGAAGAAAGCAAATTGACOT 

TCATCAGGAATACAAAGAACTAGTTGAAAAGCTGTT^ 

TTAATGAAGATCAATTTCAAGAAGCATGCACTTCTCCTCTTG 

GCCATTTTTGCAACCTGTGTTGGCAGCA<3AAGATTT^ 

AGAAAAACATTGAAATGCAGCTGCAAGCCATT 

CCTGACTGCTTAACCGATGGCTCTGATC^ 

CCTGAGGGAAGTTCTTAGAAAATCAAAAGAGGAATATGACGAGGAA 
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FTttTTRE 8 



3TTCTGCAATAGGCGGCTTAGAGGGAGGGGCTTTTTCGCCTATACCTACTG 

fc CTGCTGCTAC] 
lGAAGCTCTTC] 
ICTGTAATATGC 
^AGCTCAAGTQ 

ctgc^^gaaaacaagtatga^^ 



.TGTCCGT^TTCCACTAGAAGCTCTTCTGAGGGAGGTAATTAAAAAAC 

XTGTAATATGCTCCTTGTCAACAATGTATAC 

A ttcctg^a<*tgccata™^^ 



TAGCTTCTCCACGTATGGACCCTAAAGGCTAC 



GCTCTTATCTTCTCAAATTTTAGCATTATAACAACAGCTCTT^ 

TTTTTCAGCCCTTCCAATTCCTGC 
TACAGCAAAGGAATGGACT^^^ 



&raTrCGTCTTGGCATGGGCCATGTTCTTATTAi^i^<-««x«* 



CATGTTCCATGTCTTGATGGCCCA" 



jGGTTACCACTGTCATTATCACAACAGTGTCTGTCCTGG 



TCTTTGACTTCAGGCCCTO 



CATGTTCC 
TCTTTGAC 
TTTATTTi 
AGATCTAi 
CCAAACCL- 



'ATTTATAATGCCAGCAAGCCTCAAGTTCCGG. 



CCTGGAATTTTTCTTGGAAGCCCCATCAGTCCTTCTCTCTATA 
CCTGGAATTTT. j^taCGCACCTAGGCAAGAAAGGATCCG 



AGATCTAAGTGGCAATCTTTGGGAGCGTTCCAt 



.GTGGGGATGGAGAAGAACTAGAAAGACTTA 



ICCAAGAGTGATGAGTCAGATGAAGATACTTT 



CTAACTGGTACCCACATAGTTTGCA 



TTGTAATATTTATCTTTTCACTTTGATA 



CTTCCCTTTTTAACATTATAAAAGCTJ 



XSCTTTTTTCTCCTCAGTTTi 
["TTGGAGAATAAGAGGGCAT 
CCTTTCCAAGAGTTGAAATG 



Vr^TG<ATGCAGCCAGTTAACrCTCG 
\TTAGCAAACAAAAGTG 
rTAACTGGCAGGCCTGl 
TTCTCAGAAACTTATAC 
AGGTAATATACTATTATA1 



GTGGAAATTTGTAATTAAAATAATTATTAAACCT 



WO 99/63088 PCT/US99/12252 

FIGURE 9 

MEKQCCSHPVICSLSTMYTFIJ^IFI^ 

LVFCVLVS FCVI KKDHQSRNLKYASWKE FS D FMKWS I PAFL YFLDNL I VFYVLS YLQ PAMAV 
I FSNFS 1 1 TTALLFRI VLKRRLNW I Q WASLLTLFLS IVALTAGTKTLQHNLAGRGFHHDAFF 
SPSNSCLLFRSECPRKDNCTAKEWTFPEAKWNTTARVFSHIRLGMGHVLIIVQ 
YNEKILKEGNQLTESIFIQNSKLYFFGILFITCLTLGLQRS 

FVTAFQGLSVAFILKFLDNMFHVLMAQVTTVI ITTVSVLVFDFRPSLEFFLEAPSVLLS I FI 
YNASKPQVPEYAPRQERIRDLSGNLWERSSGDGEELERLTKPKSDESDEDTF 
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FIGURE 1ft 

CGTGCCTGCGCAATGGGTGTCGGGTCCGCTTTTTCCCAATCCGGACGTAATCGTGGTTTTTG 

TTCTGCAATAGQCGGCTTAGAGGGAGGGGC^TTTTCGCCrATACCTACTGTAGCTTCT 

GTATGGACCCTAAAGGCTACTGCTQCTACTACGGGGCTAGACAGTTACTGTCTCA^ 

GATGTGCGTTCTTCCACTAGAAGCTCTTCTGAGGGAGGTAATTAAAAAACAGTGGAATGGA^ 

AAACAGTGCTGTAGTCATCCTGTAATATGCTCCTTGTC^C^TGTATACATTCCTGC^^ 

TGCCATATTCATTGCTTTAAGCTCAAGTCGCATCTTACTAGTGAAGTATTCTG^ 
AAAACAAGTATGATTATCTTCCAACTACTGTGAAT(nX3TGCT^ 

TTCTGTGTGCTTGTGTCATTCTGTGTTATAAAC^GATCATCAAAGTAGAAATTTGAAATA 

TGCTTCCTGGAAGGAATTCTCTGATTTCATGAAGTGGTCCATTCCTGCCTTTCTTT^ 

TG^TAACTTGATTGTCTTCTATGTCCTGTCCTATCTTCAACCAGCC^T^ 

TCAAATTTTAGCATTATAACAACAGCTCTTCTATTCAGGATAGTGCTGAAC^GG 

CTGGATCCAGTGGGCTTCCCTCCTGACTTTATTTTTGTCTATTGTGGCCTTGACTGCCGGX^ 
CTAAAACTTTA 



WO 99/63088 



PCT/US99/12252 



FIGURE 11 

CGGACGCGTGGGCGGAOSCGTGGGCGGACGCGTGG 

GTGGCTAAGGCTGCTACGAAGCGAGCTTGGGAGGAGCAGCGGCCTGC^ 

CCCGTCTACCAGGTCCCAAGCGGCGTGGCC<!X3^ 

AGCGGCTCOSCGGCGGGGCTGCTACCCACCA^ 

GGTGAAGAAAGAACCGAAAAAGAAGAAACAACAGTTGTCTG 

C^CTTGGGGGAGCCCCCTACCAGGTCACG^ 

CTATTGQaiGTGGCTCAGGTGGGCCCTTTCTCroCCTCCATCATC 
CTGGGATGCCATCACAGACCCCCTGGTGGGCCTCTGCATCAGCA 
TGGGTCGCCTTATGCCCTGGATCATCTTCT 
ATCTGGTTOSTGCCCGACTTCC^ 

CTTTGAAACAATGGTCACGTGTTTCCATGTTCCCTACTOGGCTCT 
ACCGAGCAGACTGAGCGGGATTCTGCCACCGCCTATCGGATGACTGTGGA^ 
AGTGCTGGGCACGGCGATCCAGGGACAAATCGTGGGCCAAGCAGACACGCCTTC 
ACTTCAATAGCTCTACAGTAGCTTC^^ 

AGGGAAACGCAAAAGGCATACCTGCTGGCAGCGGGGGTCATTGTCTO 
TGCTGTCATCCTGATCCTGGGCGTCCGGGA^ 
AGCCAATCGCCTACTTCCGGGGCCTAC^ 
ATTACTGGCTTCCTCTTCACCT^ 

TTGC^CCTAC^C€TTGGGCTTCC<3CAATGAATTCCAGAA^ 

CGGCCACTTTAACCATTCCCATCTGGCAGTGG 

GTATATGTTGGGATCTCATCAGCAGTGCCATTTC 

CCTC^TCATTACATATGCGGTAGCTGTGGCAGCTGGCATCAGTGTGGCAG 

TACCCTGGTCCATGCTGCCTGATGTCATTGACGACTTCCATCTGAAGCAGCCCCACTTC 

GGAACCGAGCCCATCTTCTTCTCCTTCTATGTCTTCTTCACCAAGTTT 

ACTGGGK!ATTTCTACCCTCAGTCTGGACTTTG<^ 

CGGAA<X3TGTCAAGT-T^ 

CTGGGCCTGCTGCTCTTCAAAATGTA^ 

GGCCCTGCA<MCACTGAGGGACGAGGCCAGCAGCT 

AGCTGGCTAGCATCCTCTAGGGCCra^ 

AAGGGATCAGGACCTGTCTGCCX^CTTCCTGAGCAGCTGGACTGCAGGTGCTA 

CTGAAGACTCAAGGAGGTGGCCCAGGACACTTGCTGTC 

TGGCCTCCTGCCTCCCCTCTGCCTGCCroTGGGGCCAA^ 

TGCCAAGGACTGATCGGGCCTAGCCCGGAACACTAATGTAGAAACCTTTTTO 

TAATTAATAACTTAATCACTGTGTACATAGCAATGTGTGTGTATGTATATGTCTC 

TTAATGTTATTAATTTTCATAAAAGCTGGAAAGC 
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FIGURE 12 

MWLRWALSLPPSSCLWAEPGMPSQTPWWASASANPPGPAWVALCPGSSSPRPWPSLPTSSSG 
SCPTSHTARPIGTCFSIASLKQWSRVSMFPTRLSPCSSATEQTEIU3SATAYRMTVEVLGTVL 
GTAIQGQIVGQADTPCFQDFNSSTVASQSANHTHGTTSHRETQKAYLLAAGVIVCIYIICAV 
ILILGVREQREPYJIAQQSEPIAYFRGLRLVMSHGPYIKLITGFLFTSLAFMLVEGNFVLFCT 
YTLGFRNEFQNLLIAIMLSATLTIPIWQVWLTRFGKKTAVYVGISSAVPFLILVAIiMESNLI 
ITYAVAVAAGISVAAAFLLPWSMLPDVIDDFHLKQPHFHGTEPIFFSFYVFFTKFASGVSLG 
ISTLSLDFAGYQTRGCSQPERVKFTLNMLVTMAPIVLILLGLIjLFKMYPIDEERRRQNKKAL 

QALRDEASSSGCSETDSTELASIL 
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FIGURE 13 

GGGAAACGCAAAAGGCATACCTGCTGGCAGCGGGGGTCATTGTCTC 

GCTGTCATCCTGATCCTGGGCGTGCGGGAGCAGAGA 

GCCAATCGCCTACTTCCGGGGCCTACGG 

TTACTGGCTTCCTCTTCACCTCCTTGGCTTTCATGCTGG 

TGCACCTACACCTTGGGCTTCCGCAATGAAT^ 

GGCCACTTTAACCATTCCCATCTGGCA^ 

TATATGTTGGGATCTCATCAGCAGTGCCATTTCT 

CTCATCATTACATATGCGGTAGCTGTGGCAGCTGGCA^ 

ACCCTGGTCCATGCTGCCTGATGT^ 

GAACCGAGCCCAT 
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FTGIJRE 14 

CMC^CTTCGGCGCCAGCGGCCAGCGCT^ 

ATGAGCAGGTCTGAAGACTAACATTTTGTCAAGTTGTAAAACAGAAAACCTGTTAGAA^IST 

GGTGGTTTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCC^ 

TTCATATTTTCATACATTACTGCAGTAA^ 

CAGTGACACT(3GTACAGTAGCTCCAGAAAAATGCTTATTTGGGGCA^^ 
CAGTTTTATGCATTGCTACCATTTATGTTCGTTATAAGCAAGTTCATGCT 
GAGAACGTTATCATCAAATTAAACAAGGCTGGCCTTGTACTTGGAATAC^ 
ACTTTCTATTGTGGCAAACTTCCAGAAAACAACCCTTTTTGCTGCA^ 

TGCTTACCTTTGGTATGGGCTCA^^ 

CAGCCCAAAATCCATGGCAAACAAGTCTTCTGGA^ 

AGTAAGTGCACTTAGCATGCTGACTTGCTCATCAGTTTTGCAC^OTC^ 

ATTTAGAACAGAAACTCCATTG^ 

ACTGCAGCAGAATGGTCTATGTCATTTTCCTTCTTTGGTTTTTTCCTGACTTA 
TTTTCAGAAAATTTCTTTACGGGTGGAAGCCAATTTACATGGATTAACCCTCTATGAC^ 
CAC CTTGCCCTATTAACAATGAACGAACACGG CT ACTTTC GAGAGATATTTGATGAAAGGAT 
AAAATATTTCTGTAATGATTATGATTCTCAGGGATTGGGGAAAGGTTC^^ 

TTCTTCTCTGAAATTTT^ 

ATCAGGAAACATGAAAGAAGCCATTTGATAGATTATTCTAAAGGATATC^TCAAGAAGACTA 
TTAAAAACACCTATGCCTATACTTTTTTATCTCAGAAAATAAAGTCAAAAGACT 
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FIGURE IS 

MWWQQGLSFLPSALVIVITSAAFIFSYITAVTLHHIDPALPYISDTGTVAPEKCLFGAMLNI 
AAVLC I ATI YVRYKQVHALSPEENVI I KLNKAGLVLG I LS CLGLS IVANFQKTTLFAAHVSG 
AVLTFGMGS LYMFVQT ILS YQMQ P KI HGKQVFWI RLLL VI WCGVS ALSMLTCS S VLHSGNFG 
TDLEQKLHWNPEDKGYVLHMITTAAEWSMSFSFF6FFLTYIRDFQKISLRVEANLHGLTLYD 
TAPCPINNERTRLLSRDI 
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FIGURE 16 

CGGACGCTTGGGCNGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGTGCCTGATGCCGAGT 
TCCGTCrrcrCGGGTCTTTTCCTGGTCCCAGGCAAAGCGGAGCGGAGATCCTCAAACGGCCTA 
GTGCTTCGCGCTTCCGGAGAAAATCAGCGGTCTAATTAATT^ 

TACCAAGAATCTTCAACCCTTTCCCACAAAAGCTAATTGAGTACACGTTCCTGTTGAGTACA 

CGTTCCTGTTGATTTACAAAAGGTGCAGGTATGAGCAGGTCTGAAGACrAACATTTT^ 

GTTGTAAAAC^GAAAACCTGTTAGAAATGTGGTGGTTTCAGCAAGGCOTCAGTTTCCTTCCT 

TCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTTTCATACATTACT6CAGTAACACT 

CCACCATATAGACCCGGCITTACCTTATATCAGTGACACTGGTACAGTANC 
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FIGURE 17 

CCC^CGCGTCCGCCO^CGCTGCGTCCCGGAGTGCAAGTGAGCTTCTCGGCTGCCCCGCGGG 
CCGGGGTGCGGAGCCGACMSCGCCCGCTTCTCGGCCTCCTTCTGGTCTTCGCCGGCTGCAC 
CTTCGCCTTGTACITGCTGTCGACGCGACTGCCCCGCGGGCGGAGACTGGGCTCCACCGAG^ 
AGGCTGGAGGCAGGTCX3CTGTGGTTCCCCTCCGACCT 

GTCCTTCGAGAGTACCGGAAGGAGCACCAGGCCTACGTGTTCCTGCTCTTCrrGCGGCXXICTA 

CCTCTACAAACAGGGCITTGCCATCCCCGGCTCCAGCT 

TGTTTGGGCCATGGCrGGGGCTTCrGCTGTGCTGTGTGTTGACCr 

TGCTACCTGCTCTCCAGTATTTTTGGCAAACAGTTGGTGGTGTCCTACTTTCCTGATAAAGT 

GGCCCTGCTGCAGAGAAAGGTGGAGGAGAACAGAAACAGCITGTTTTTTT^ 

TGAGACTTTTCCCGATGAC^CCAAACTGGTTCTTGA 

CCCATCGTGCAGTTCTTCTTCTCAGTTCTTATCXMTTTGATCCCATATAATTTCATCTGTGT 

GCAGACAGGGTCOVTCCTGTCAACCCrrAACCrCT 

TCTTTAAGCD3TTGGCCATTGCCATGGTGGCATTAATTCCTGGAA 

AGTCAGAAACATCTGCAATTGAATGAAACAAGTACTGCTAATCATATACACAGTAGAAAAGA 

PACRTQATCTGGAITITLTGTITGCCAC^^ 
TGTGGTCCTCTAAAGCCCCTCATTGTTTTTGATC 

CATCAATGTGCAOTGTCTTTT*" 1 & & ACtfl AC A OTgTGCTCTTGAAGGTGT ATTACATCAGGT 
TTTCAAACCAGCCCTGGTGTAGCAGACACTGCA^ 

GGCCGGGCGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCCGGTGATTC 

ACAAGGTCAGGAGTTGAAGACC^GCCTGGCCAAGAT<^TGAAATCCTGTCTCTAATAAAAAT 

ACAAAAATTAGCCAGGCGTGGTGGC^GGCACCTGTAATCCCAGCTACTCGGGAGGCTGAGGC 

AGGAGAATTGCTTGAACCAAGGTGGC^GAGGTTGCAGTAAGC 

CCAGCCTGGGTGATAGAGTGAGACACTGTCTTGAC 
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FTGTTRE 18 

MRPLLGLLLVFAGCTFALYI^STRLPRGRRLGSTEEAGGRSLWFPSDI^LRELSEVLRETO 
KEHQAYVFLLFCGAYLYKQGFAIPGSSFLNVIjAGALFGPWI^LLLCCVLTSVGATCCYLLSS 
IFGKQLWSYFPDKVALLQRKVEENMISLFFFLLFLRLFPMTPNWFLNIiSAPILNIPIVQFF 
FSVLIGLIPYNFICVQTGSILSTLTSLDALFSWDTVFKLLAIAMVALIPGTLIKKFSQKHLQ 

LNETSTANHIHSRKDT 
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FIGURE 19 

CCGAGGCGGGAGGAGC^ 

CCAGTTCTCS^CCGCTCAGTTGTGATCAAGTO 

TAGGAAAATAACTTGGGATTTTATATTCGAAGACaSSGATCTO 

TTTATGACAAACTTTCAGAGACTGTTGATTTGGTGA 

TCAGAGAAGGCAATTGAAAAATTTATCAGACAGCTGCTGGAA 

CCCCCCGCAGTATCCTCTCCTTATAGTTGTGTA^ 

TGCTCACTGCCTACTTTGTGATTCAACCTTTCAGCCCAT^ 

GGAGCTCACACCTGGCGCTCACTCATCCATC1ACATTAGGCT 

GAAGTACATGTCAGAAAATAAGGGAGTTCCTCTGCATGGGGGTGATGAAGACAGA 

CAGACTTTGACCCCTGGTGGACAAACGACTGTGAGCAGAATGAGTCAGAGC 

AACTGCACTGGCTGTGCCCAGAAAC^ 

ATTTGAGAGGCTCCATCCACTGGTGATCAAGACGGGAAAGCTC 

AGCATTTTTTGTGCCAGTACCCTGAGGCGACAGA^ 

TGGTGGCGCTGCTTTCCTGAGCGGTGGTTCCCATTTCCTO 

CAGATCACAAATGTTACGTGAGCTTTTTCCTGTTT^ 

CCTCTTTAAACAAGTGCTCCTTTCTTCA 

ATGCCTGACCTATTTATCATTCXjC^ 

GTGCCGAAGACATTGTCAGTCTGTGG<^ATGCCAATAGAGCCAGGGGATATCX^ 
ACACCACCCACTGGAAGGTCTACGTTATAGCCAGAGGGGTCCAGCCTTTGG 
GGAACCGCTTTCTCAGAACTGE^^AAATAGAAC 
AAACCAGGTTGAAAGGGGAAAAATAAAAACAAAAACGATG 
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FTOIJRE 20 

MDLAANEISIYDKLSETVDLWQTGHQCGMSEKAIEKPIRQLLEKNEPQRPPPQYPLLIVVY 

KVIATLGLILLTAYFVIQPPSPL^EPVLSGAHTWRSLIHHIRLMSLPIAK^ 
HGGDEDRPFPDFDPVWTNDCEQNESEPIPANCTGCAQKHLKVMLLEDAPRKFERLHPLVIKT 

QKPIiLEEEIQHFLCQYPEATEGFSEGPFAKWWRCPPERWPPPPYPWRRPIJlRSQMLRELPPV 
FTHLPFPKDASLNKCSFLHPEPWGSKMHKMPDLFIIGSGEAMLQLIPPFQCRRHC©SVAMP 
IEPGDIGYVDTTHWKVYVIARGVQPLVICDGTAFSEL 
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FTOTJRE 21 

CC^CGGTGTCX:GTTCTTCX3CCCGGCGGCAGCTGTCCCCGAGGCGGGAGGAGCCCGAGGGGCG 
CGAGCCCCGCATGAATCATTGTAGTCAATCATTTTCCAGTTCTCAGCCGTTCAGTTGTGATC 

AAGGGACACGTGGTTTCCGAACTGCCAGCTCAGAATAGGAAAATAACT 
GGAAGACATGGATCTTGCTGCK^CGAGATCAGCATTTATG^ 
ATTTGGTGAGACAGACCGGCCATCAGT6TGGCATGTCAGAGAAGGCAATTGAAAAA 
AGACAGCTGCTGGAAAAGAATGAACCTCAGAGACCCCCCCCGCAGTATCCTCTCCTTATAGT 

TGTGTATAAGGTTCTC^CAACCTTGGGATTAATCTTGCTCACTGCCTACT 
CTTTCAGCCCATTAGCACCTGAGCCAGTGCTTTGTGGAGCTCAC 
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CCCACGCGTCCGCCCACGCGTCCGGCTGAACACCTCTTCTTTGGAGTCAGCCACTGATGAGG 
^GGGTCCCCACTTGCAGCTGCAGCAGCTGCAGCAGCTGCAGAGCGCTGCTCCTGGCTGGTG 
CCACTGGTGCGCACGCTGCTAGACCGTGCCTATGAGCCGCTGGGGCTGCAGTGGGGA 
CTCCCTGCCACCCACCAATGGCAGCCCCACCTTCTTTGAAGACITCCAGGCTTTTTGTGCCA 

CACCCGAATGGCGCCACTTCATCGACAAACAGGTACAGCCAACCAIGTCCCAGTTCGAAATG 

C^CACGTATGCTAAC^GCC^CGACCTTATGTCAGGTTTCTGGAATGCCTGC^TGACATGCT 

TATGAGCAGTGGGCAGCGGCGCCAGTGGGAGCGCGCCCAGAGT03TCGGGCCTTCCAGGAGC 

TGGTGCTGGAACCTGCGCAGAGGCGGGCGCGCCTGGAGGGGCTACGCTACACGGCAGTGCTG 

AAGCAGCAGGCAACGCAGCACTCCATGGCCCTGCTGCACTGGGGGGOGCTGTGGCGCCAGCT 

CGCCAGCCCATGTGGGGCCTGGGCGCTGAGGGACACTCCCATCCCCCGCTGGAAACTGTCCA 

GCGCCGAGAC^TATTCACGCATGCGTCTGAAGCTGGTGCCCAACCATCACITCGACCCTCAC 

CTGGAAGCC^GCGCTCTCCGAGACAATCTGGGTGAGGTTCCCCTGAC^CCCACCG^AGGAGGC 

CTCACTGCCTCTGGCAGTGACC^AAGAGGCCAAAGTGAGCACCCCACCCGAGTTGCTGCAGG 

AGGACCAGCTCGGCGAGGACGAGCTGGOTGAGCTGGAGACCCCGATGGAGGGAGCAGAACTG 

GATGAGCAGCGTGAGAAGCTGGTGCTGTCGGCCGAGTGCCAGCTGGTGACGGTAGTGGCCGT 

GGTCCCAGGGCTGCTGGAGGT(^CCACAC^GAATGTATACTTCTACGATGGCAG^CTGAGC 

GCGTGGAAACCGAGGAGGGCATCGGCTATGATTTCCGGCGCCCACTGGCCCAGCTGCGTGAG 

GTCCACCTGCGGCGTTTCAACCTGCGCCGTTCAGCACTTGAGCTCTTCTTTATCGATCAGGC 

CAACTACTTCCTCAACTTCCCATGCAAGGTGGGCACGACCCCAGTCTCATCTCCTAGCCAGA 

CTCCGAGACCCCAGCCTGGCCCCATCCCACCCCATACCCAGGTACGGAACCAGGTGTACTCG 

TGGCTCCTGCGCCTACGGCCCCCCTCTCAAGGCTACCTAAGCAGCCGCTCCCCCCAGGAGAT 

GCTGCGTGCCTCAGGCCTTACCCAGAAATGGGTACAGCGTGAGATATCCAACTTOTAGTACT 

TGATGCAACTCAACACCATTGCGGGGCGGACCTACAATGACCTGTCTCAGTACCCTGTGTTC 

CCCTGGGTCCTGCAGGACTACGTGTCCCCAACCCTGGACCTCAGCAACCCAGCCGTCTTCCG 

GGACCTGTCTAAGCCCATCX3GTGTGGTGAACCCCAAGCATGCCCAGCTCGTGAGGGAGAAGT 

ATGAAAGCTTTGAGGACCCAGCAGGGACCATTGACAAGTTCCACTATGGCACCCACTACTCC 

AATGCAGCAGGCGTGATGCACTACCTCATCCGCGTGGAGCCCTTCACCTCCCTGCAC^GTCCA 

GCTGCAAAGTGGCCGCTITC^CTGCTCCGACCGGCAGTTCCACTCGGTGGCGG^GCCT^C 

AGGCACGCCTGGAGAGCCCTGCCGATGTGAAGGAGCTCATCCCGGAATTCTTCTACTTTCCT 

GACTTCCTGGAGAACCAGAACGGTTTTGACCTGGGCTGTCTCCAGCTGACCAACGAGAAGGT 

AGGCGATGTGGTGCTACCCCCGTGGGCCAGCTCTCCTGAGGACTTCATCCAGCAGCACCGCC 

AGGCTCTGGAGTCGGAGTATGTGTCTGCAC^CCTACACGAGTGGATCGACCTCATCTTTGGC 

TACAAGCAGCGGGGGCCAGCCGCCGAGGAGGCCCTCAATGTCTTCTATTACTGCACCTATGA 

GGGGGCTGTAGACCTGGACCATGTGACAGATGAGCGGGAACGGAAGGCTCTGGAGGGCATTA 

TCAGC^CTTTGGGCAGACTCCCTGTCAGCTGCTGAAGGAGCCACATCCAACTCGGCTCTCA 

GCTGAGGAAGCAGCCC^TCGCCTTGC^CGCCTGGAC^CTAACTCACCTAGCATCTOCAGCA 

CCTGGACGAACTCAAGGCATTCTTCGCAGAGGTGACTGTGAGTGCCAGTGGGCTGCTGGGCA 

CCCACAGCTGGTTGCCCTATGACCG(^ACATAAGCAACTACTTCAGCTTCAGCAAAGACCCC 

ACCATGGGCAGCCACAAGACX3CAGCGACTGCTGAGTGGCCCGTGGGTGCCAGGCAGTGGTGT 

GAGTGGACAAGCACTGGCAGTGGCCCCGGATGGAAAGCTGCTATTCAGCGGTGGCCACTGGG 

ATGGCAGCCTGCGGGTGACTGCACTACCCCGTGGCAAGCTGTTGAGCCAGCTCAGCTGCCAC 

CTTGATGTAGTAACCTGCCTTGCACTGGACACCTGTGGCATCTACCTCATCTCAGGCTCCCG 

GGACACCACGTGCATGGTGTGGCGGCTCCTGCATCAGGGTGGTCTGTC^GTAGGCCTGGCAC 

CAAAGCCTGTGCAGGTCCTGTATGGGCATGGGGCTGCAGTGAGCTGTGTGGCCATCAGCACT 

GAACTTGACATGGCTGTGTCTGGATCTGAGGATGGAACTGTGATCATACACACTGTACGCCG 

CGGACAGTTTGTAGCGGCACTACGGCCTCTGGGTGCCACATTCCCTGGACCTATTTTCCACC 

TGGCATTGGGGTCCGAAGGCCAGATTGTGGTACAGAGCTCAGCGTGGGAACGTCCTGGGGCC 

CAGGTCACCTACTCCTTGCACCTGTATTCAGTCAATGGGAAGTTGCGGGCTTCACTGCCCCT 

GGCAGAGCAGCCTACAGCCCTGACGGTGACAGAGGACTTTGTGTTGCTGGGCACCGCCCAGT 

GCGCCCTGCACATCCTCCAACTAAACACACTGCTCCCGGCCGCGCCTCCCTTGCCCATGAAG 

GTGGCCATCCGCAGCGTGGCCGTGACCAAGGAGCGCAGCCACGTGCTGGTGGGCCTGGAGGA 

TGGCAAGCTCATCGTGGTGGTGGCGGGGCAGCCCTCTGAGGTGCGCAGCAGCCAGTTCX5CGC 

GGAAGCTGTGGCGGTCCTCGCGGCGCATCTCCCAGGTGTCCTCGGGAGAGACGGAATACAAC 

CCTACTGAGGCGCGCTGAACCTGGCCAGTCCGGCTGCTCGGGCCCCGCCCCCGGCAGGCCTG 

gcccgggaggccccg"cccagaagtcggcgggaacaccccggggtgggcagcccagggggtga 

GCGGGGCCCACCCTGCCCAGCTCAGGGATTGGCGGGCGATGTTACCCCCTCAGGGATTGGCG 
GGCGGAAGTCCCGCCCCTCGCCGGCTGAGGGGCCGCCCTGAGGGCC^GC^CrGGCGTCT 
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FIGURE 23 

MSQFEMDTYAKSHDLMSGFWNACTO 

RYTAVLKQQ ATQHSMALLHWGALWRQLAS P CGAWALRDTP I PRWKLSSAETYSRMRLKLVPN 

HHFDPHLEASALRDNLGEVPLTPTEEASL^ 

MEAAELDEQREKLVLSAECQLVTVVAWPGLLEVTTQNV^ 

LAQLREVHLRRFNLRRSALELFFIDQANYFLNFPCKVGTTPVSSPSQ 

RNQVYSWLLRLRPPSQGYLSSRSPQEMLRASGLTQ 

SQYPWPWVLQDYVSPTLDLSNPAVFRDLSKPIGVVNPKHAQLVREKYESFEDPAGTIDK^ 
YGTHYSNAAGVMHYLIRVEPFTSLHVQLQ^ 

EFFYFPDFLENQNGFDLGCLQLTNEKVGDVVLPPWASSPEDFIQQHRQALESEYVS 
IDLI FGYKQRGPAAEEALNVFYYCTYEGAVDLD 

HPTRLS AEEAAHRLARLDTNS PS I FQHLDELKAF FAE VTVS ASGLLGTHS WL P YDRN I SNYF 
SFSKDPTMGSHKTQRLLSGPWPGSGVSGQALAVAPIXjKLLFSGGHWIXj 
SQLSCHLDWTCLALDTCG I YL I SGSRDTTC^WRLLHQGGLS VGLAPKPVQVL YGHGAAVS 
CVAI STELDMAVSGSEDGTVI IHTVRRGQFVAALRPLGATFPGP I FHLALGSEGQI WQSSA 
WERPGAQVTYSLHLYSVNGKLRASLPLAEQPTALTVTEDF 

PPLPMKVAIRSVAVTKERSHVLVGLEDGKLIVVVAGQPSEVRSSQFARKLWRSSRRISQVSS 
GETEYNPTEAR 



WOW63M8 2^/3^° PCT/USM/1XBJ 

MttTTKK 24 

SSSACGCGTGGGCGGACGCGTGGGGOCTGTGAGAAAGTGCCAATAAATACATCATGCAACCC 

CACGGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCAT 

CCAAAGGCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTAT 

TGGACCCTTAACTGGGTACrrGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGC 

CTACTGGGCCTTCCACAAGCCCCAGGACATCC^ 

GCACACTCCGTTACCACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAG 

ATAGCCCGGGTCATCTTGGAGTATATTGACCACAAGCTCAGAGGAGTGCAGAACCCTGTAGC 

CCGCTGCATCATGTGCTGTTTCAAGTGCTGCCTCTGGTGTCTGGAAAAATTTATCAAGTTCC 

TAAACCGCAATGCATACATC^TGATCGCCATCTACGGGAAGAATTTCTGTGTCT 

AATGCGTTCATGCTACTCATGCGAAACATTGTCAGGGTGGTCGTCCTGGACAAAGTCACA 

CCTGCTGCTGTTCTTTGGGAAGCTGCTGGTGGTCGGAGGCGT 

TTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAGACTTTAAGAGCCCCCACCTCAACTATTAC 
TGGCTGCCCATCATGACCTCCATCCTGGGGGCCTATGTCATCGCCAGCGGCTTCTTCAGCGT 
TTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCTGGAAGACCTGGAGCGGAACAACG 
GCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAAAGATTCTGGGCAAGAAGAAC 
GAGGCGCCCCCGGACAACAAGAAGAGGAAGAAGTgftCAGCTCCGGCCCTGATCCAGGACTGC 
ACCCCACCCCCACCGTCC^GCCATCCAACCTCACTTCGCCTTAC^GTCrCCATTTTGTGGT 
AAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACTTTGAGAGGCTG 
AGGCGGGCGGATCACCTGAGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTGAAACCTCC 
GTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCAGCTAC 
TCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGAGA 
TCGCGCCACTGCACTCCAACCTGGGTGACAGACTCTGTCTCCAAAACAAAACAAACAAACAA 

AAAGATTTTATTAAAGATATTTTGTTAACTC 
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FIGURE 25 

RTRGRTRGGCEKVPINTSCNPTAHLW 

WTLNWVLALGQCVLAGAFAS FYWAFHKPQD I PTFPL I S AF IRTLRYHTGSLAFGAL ILTLVQ 

IARVILEYIDHKLRGVQNPVARCIMCCFKCCL^^ 

NAFMLLMRNIWVVVLDKVTDLLLFFGKLLWGGVCVLSFFFFSGR 

WLPIMTSILGAYVIASGFFSVPGMCVOTL^ 

EAPPDNKKRKK 
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FIGURE 26 

GAGTCTTGACCGCCGCCGGGCTCTTGGTACCTCAGCGCGAGCGCCAGGCGTCCGGCCGCCGT 
GGCTA2STTCGTGTCCGATTTCCGCAAAGAGTTCTACGAGGTGGTCCAGAGCCAGAGGGTCC 

TTCTCITCGTGGCCTCGGACGTC^ATGCTCTGTGTGCGTGCAAGATCCTTCAGGCOT 

CAGTGTGACC^CGTGCAATATACGCTGGTTCCAGTTTCTGGGTGGCAAGAACTTGAAACrrGC 

ATTTCTTGAGCATAAAGAACAGTTTCATTATTTTATTCTCATAAACTGTGGAGCTAATGTAG 

ACCTATTGGATATTCTTCAACCTGATGAAGACACTATATTCTTTGTGTGTGACTCCCATAGG 

CCAGTCAATGTCGTCAATGTATACAACGATACCCAGATCAAATTACTCATTAAACAA 

TGACCTTGAAGTTCCCGCCrATGAAGACATCTTCAGGGATGAAGAGGAGGATGAAGAGCATT 

CAGGAAATGAC^GTGATGGGTCAGAGCCITCTGAGAAGCGCACACGGTTA 

GTGGAGCAAACCATGCGGAGGAGGCAGCGGCGAGAGTGGGAGGCCCGGAGAAGAGACATCCT 

CTTTGACTACGAGCAGTATGAATATCATGGGACATCGTCAGCC^TGGTGATGTTTGAGCTGG 

CTTGGATGCTGTCCAAGGACCTGAATGACATGCTGTGGTGGGCCATCGTTGGACTAACAGAC 

CAGTGGGTGCJ^GACAAGATCACTCAAATGAAATACGTGACTGATGTTGGTGTCCTGCAGCG 

CCACGTTTCCCGCCACAACCACCGGAACGAC^ATGAGGAGAACACACTCTCCGTGGACTGCA 

C^CCK^TCTCCTTTGAGTA^ 

AGCCTOTGCAACACCAGCTATACCOCAGCCAGGTTCAAGCTGTGGTCTGTGC^TGGACAGAA 

GCGGCTCCAGGAGTTCCTTGCAGACATGGGTCTTCCCCTGAAGCAGGTGAAGCAGAAGTTCC 

AGGCCATGGACATCTCCTTGAAGGAGAATTTGCGGGAAATGATTGAAGAGTCTGCAAATAAA 

TTTGGGATGAAGGACATGCGCGTGCAGACTTTCAGCATTCATTTTGGGTTCAAC^CAAG 

TCTGGCCAGCGACGTGGTCTTTGCCACCATGTCTTTGATGGAGAGCCCCGAGAAGGATGGCT 

C^GGGACAGATCACTTCATCC^GGCTCTGGACAGCCTCT^ 

taccatggcctggaactcgccaagaa^^ 

CTTTGCACCAACCTCGTCATCTCCCAGGGGCCTTTCCTGTACTGCrCT 

TCCAGATGTC^TGCTGTTCTCrA(3GCCGGCATCCCTAAGCCTGCTCAGCAAACAC 

AGTCCTTTGTGTGTTCGACAAAGAACCGGCGCTGCAAACTGCTGCCCCTGGTGATGGCTGCC 

CCCCTGAGCATGGAGCATGGCACAGTGACCGTGGTGGGCATCCCCCCAGAGACCGACAGCTC 

GGACAGGAAGAACTTTTTTGGGAGGGCGTTTGAGAAGGCAGCGGAAAGCACCAGCTCCCGGA 

TGCTGCACAACCATTTTGACCTCTCAGTAATTGAGCTGAAAGCTGAGGATCGGAGCAAGTTT 

CTGGACGCACTTATTTCCCTCCTGTCCTAGJSAATTTGATTCTTCCAGAATGACC^ 
TATGTAACTGGCTTTCATTTAGATTGTAAGTTATGGACATGATTTGAGATGTAGAAGCCATT 

TTTTATTAAATAAAATGCTTATTTTAGGAAA 
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FIGURE 27 

MPVSDFRKEFYEWQSQRVLLFVASDVDALCACKILQALFQCDHVQYTLVPVSGWQELETAF 
LEHKEQFHYFILINCGANVDLLDIWPDBDTIFFVCDSHRPVNVVNVYNDTQIKLLIKQDDD 
LEVPAYEDIFRDEEEDEEHSGNESIX3SEPSEKRTRLEEEIVEQTMRRRQRREWEARRRDILF 
DYEQYEYHGTSSAMVMFEIAWMLSKDLNTOMLWWAIVGLTDQWVQDKITQhaCYVTO 
VSRHNHRNEDEENTLSVDCTRI SFEYDIJ^VLYQHWSLHDSLCNTSYTAARFKLWSVHGQKR 
LQEFLADMGLPLKQVKQKFQAMDI SLKENLREMI EESANKFGMKDMRVQTFSIHFGFKHKFL 
ASDWF ATMS LMES PE KDGSGTDHF I QALD S LSRSNLDKLYHGLELAKKQLRATQQT I AS CL 
CTNLVISQGPFLYCSLMEGTPDVMLFSRPASLSLLSKHLLKSFVCSTKNRRCKLLPIA7MAAP 
LSMEHGTVTWGIPPETOSSDRKOTFGRAFEKAAESTSSRMLHNHFDLSVIELKAEDRSKFL 
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GTACCTCAGCGCGAGCGCCAGGCGTCCGGCCGCCGTGGCTATGNTCGTGTCCGATTTCCGCA 
AAGAGTTCTACGAGGTGGTCCAGAGCCAGAGGGTCCTTCTCTTCGTGGCCTCGGANGTGGAT 
GCTCTGTGTG03TGCAAGATCCTTCAGGCCTTGTTCCAGTGTGACCANGTGCAATATANGCT 
GGTTCO^GTTTCTGGGTGGCAAGAACTTGAAACTGCATTTCTTGAGCATAAAGAACAGTTTC 
ATTATTTTATTCTCATAAACTGTGGAGCTAATGTAGACCTATTGGATATTCTTCAACCTGAT 
GAAGACACTATATTCTTTGTGTGTGACACCC^TAGGCCAGTCAATGTTGTCAATGTATACAA 

CGATACCC 



WO 99/63088 ^ / PCT/US99/12252 
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CAGGAACCCTCTCTTTGGGTCTGGATTGGGACCCCTTTCC^ 
CACGAAGGGACGATACCAGAAAACACCCTCAACCCAAAGQAAATA^ 

GCTGACTTTGGCTATAGAAAAAAGAAAGGAACGAAAAGAGACAGTTTTTTTTGGAAAGCT^ 

GTCTTCCCTTTATCGAGTCAAGAAACCCCCCCT^ 

GAGTAAAGTACGCTCCGGTCACCATGGTGACAGC 

CTGCTCTTTCTCCTGATGTGTGAGATCCGTATGGTC 

CAGCGGCTGCCAACGGTGCTGTGACTCTGAGGACCCCCTGGATC 

CCTCTTCCTCCGGCCGCCCCCACGCCCTGCCTGAGATCAGACCCTACATTAATATCACCATC 

CTGAAGGGTGACAAAGGGGACCCAGGCCCAATGGGCCT 

TCCCCAAGGGGAGCCTGGCCCTCAGGGCAGC^ 

GCGCCCCGTGCCAGAAGCGCTTCrTCGCCTTCTCAG 

GGCGAGGACTTCCAGACGCTGCTCTTCGAAAGGGTCTTTGTGAACCT 

CATGGCGACCGGCGAGTTTGCTGCTCCCCTGCX5TGGCATCTACT^ 

ACAGCTGGAATTACAAGGAGACGTACGTCCACATTATC 

CTGTACGCGCAGCCCAGCGAGCGCAGCATCATGCAGAGCCAG 

CTACGGGGACCGCGTCTGGGTGCGGCTCTTCAAGCGCCAGCGCGAGAACGCCATCTACAGC^ 

ACGACTTCGACACCTACATCACCTTCAGCGGCGACCTCATCAAGGCCGAGGACGACTO 

CCTCTGGGCCACCCTCCCGGCTGGAGAGCTC^GGTGCTGGTCCCGTCCCCTGCAGGGCTCAG 

TTTGCACTGCTGTGAAGCAGGAAGGCCAGGGAGGTCCCCGGGGACCTC 

CCCrGCTTCTATCTTGGCTGCCATCATCCCTCCCAGCCTATTTCTGCTC 

TGGACCTATTTTAAGAAGCTTGCTAACCTAAATATTCTAGAACTTTCCCAGCCTCGTAGCCC 

AGCACTTCTCAAACTTGGAAATGCATGCGAATCACCC^ 

GACTCAGCAGGTCTGAGTGGGTCCAGGATTCTGTGTTTCTCATATGTTCCTGGGTC 

ATGGGGTCAGTCTATGAACCACACTGGAGCAACCAGGTTCTAGGACTTTCT 

TACTTTCTGAACATTCTGGAATCCTCCCCACATTCT^ 

TGAGACAGAGTCTTGCTCTGTTGCCCAGGCTAGAGTGCAGTGGTGCAATCTCAGTTC^ 
AACCTCTGCCTCCCGGGTTCAAGCGATTCTTCTGCCTCAGCCTCCCTAGTGGCTGGGATTA^ 
AGGCGCCTGCTACCATGCCTGGCTAATTTT^^ 
TTGGCGAGGCTGGTCTTGAACTCCTGACTTCAGGT^ 
GCTGGGATTACAGGTGTGAGCGACCGTGCCTGGCCA 
CCCTCCAGGGCTCCCCGTGCTATGTTCTCT 
-TG<^CCAeTGeAGeeAeeGTT<^T~T-TATTGAT-TGA 
GGGTCCCGGGAAGGGTGAGGGGGTCAGACACAGGCCCTGCCCCTGCCCTCAGTGA 
GTCCAGCCCAGGCGGGGAGAGATGTGTACATAGGTTTTAAAGCAGACCCAGAGCTCATGGGG 
GCCTGTGTTCTGGGTGTTCAGGTGCTGCTGK3TCCT 

TGGGACGGGGTCCCGGTGGCAGGGGCAGGTATCTCCTTCCCGTTCCTCATCCACC 
TGCTCATCGTTACAGCAAACCCCAGGGGGCCTTGGCCAGGTCAAGGGTTCT^TGAGGAGA^ 
ACCCAGGAGTGTGGGGGCATTTGGGGGGTGAAGTGGCCC^ 
TAGCTCTCCCCACAGCTGATACGGCATCCTGCGAG 

CTTCCTGCCTCCTCCCAGGGCTCTCCCAGGGCCTTGCTCAGTCCCTTCCA 
GAACTTCCGTTTCCCC&GGGCCTC^ 

CTCTGCCCCTCATGCCCCTCTCACCGGCCCAGTGCCCCGACTCTCCAGGCTTTATCAAGGTG 

CTAAGGCCCGGGTGGGCAGCTCCTCGTCTCAGAGCCCTCCTCCGGCCTGGTGCTGCCTTTAC 

AAACACCTGCAGGAGAAGGGCGACGGAAGCCCCAGGCTTTAG^ 

AGCTAGAGCAAAGGAGGGACCTCAGGCCTTCCGTTTC 

GTTCCCCTAGCCTTCCAAACCCAGGTGGCCTGCCCT^ 

CCATTGGTGCTCATGCAGACTCTGGGGCTGAGGTGC^ 

AGCCGAGGGAGCCGTGGCTCCATGGCCAGATGACGGAAACAG 

AGACCTGTGCTATAAACC^CCCTGCCTGATC^^ 

GTCCAGCATGATTAAAGAATGCTGTCTCCTCTTGGAAAAAAAAAA 
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MVTAALGPWAALLLFLLMCEIRMVELTFDRAVASGCQRCCDSEDPLDP 
ALPEIRPYINITILKGDKGDPGPMGLPGYMGREGPQGEPGPQGSKGDKGEMGSPGAPCQKRF 

FAFSVGRKTALHSGEDFQTLLFERVFVN^ 
YVHIMHNQKEAVILYAQPSERSIMQSQSVMLDIAYGDRVWVRIJ 

FSGHLI KAEDD 

Important features : 
Signal peptide: 

amino acids 1-20 

N-glycosylation site. 

amino acids 72-75 



Clq domain proteins. 

amino acids 144-178, 78-111 and 84-117 



WO 99/63088 PCT/US99/12252 
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ACTCGAACGCAGTTGCTTCGGGACCCAGGACC^ 
AGGCCGOMCCTGCCCCGCCCGGCT^^ 
CTCCAGGGTCCCTCTGCTGCTGCrc 
GCTGCCCATCCGGCTGCCAGTGCAGCCAGCC^ 

ACCACGGTGCCCCGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCAT 
CACCATGCTCGACGCAGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTC^ 

AG&ACCAGATCGCCAGCCTGCC 

GACCTGACGGCCAACAGK3CTGCATGAAATC 

CGAGCGCCTCTACCTGGGC^GAACCGC^ 

TCGACCGCCTCCTGGAGCTC^GCTGCAGGAC^^ 

CTGCCCCGCCTGCTGCTGCTGGACCT^ 

CCTGGACACTGCCAACGTGGAGGCGCTGCGGCTGGCTOGTCTGGGGCTGC^ 

AGGGGCTCTTCAGCCGCTTGCGCAACCTCCACG^ 

CGAGTGCCACCTOTGATCCGAGGCCTCC^ 

CCGCATTGCCCAGCTGa^CCCGAGGACCTC^ 

TGAGCAACCTAAGCCTGGAGGCCCTGTCCTGGCGA 

CTGCTGGCAGCTGCCCGCAACCCCTTCAACTGOT 

GGTGCGCGAGAGCCACGTCACACTGK3CCAGCCCTGAGGA 

AGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTATC 

ACCACAGCCACAGTGCCCACCAC<3AGGCCCGTGGTG<X5GGAC^ 

CTTGGCTCCTACCTGGCTTAGCCCCACAGC^ 

CTGCCCCACCGACTGTAGGGCCTGTC^ 

AATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCT 
CACGGGCCTGTACTGTGAGAGCCAGATGGGGCAGG^^ 

CXjCCGAGGCCACCACGGTCCCTGACCCTGGGCATCG^CCGGTGAGCC^CACCTCCCTGCGC 

GTGGGGCTGCAGCGCTACCTCCAGGGGAGCT^ 
TCGCAACCTATCGGGCCCTGATAAGOMCTGGTC 
AGTACACGGTCACCCAGCTGCGGCCCAACGCCACTTACTCCGTCTC 
CCCGGGCGGGTGCCX^AGGGCGAGGAGGCCTGCGGGGAGGCCXIATACACCCC 
CTCCAACCACGCCCCAGTCACCCAGGCCC^^ 
* CCCTGGCCGCGGTGCTCCTGGCCX3CGCTGGCT 
GGGCGGGCCATGGCAGCAGCGGCTCAGGACAAAGGGCAGGTGGGGC 
GXIAACTGGAGGGAGTGAAGGTCCCCTTGGAGCCAGGCC^ 
AGGCCCTGCCCAGCGK^TCTGAGTGTGAGGTGC^ 
CAGTCACCCCTCCACGCAAAC^ 
GGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCT 

CAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCTGGGCCCTGTTCCC 

CCTCXJGTCTCCTCATCTGTGAGATGCTC 

CGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCTO 

GGCCCTGCC^TGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCT 

CCCTGGGGGCCAGTGAAGGAAGCTCCCGG 

TGACTCTAGTCTTGGC<TCCA<^AAGCGAA 

GGAACATGTTTTGCTTTTTTAA 

GG AAGATGTTTTTCAAACTCAGAGACAAGGACTTTGGTTTTTGT 
AAGGCCTTTTGTAAGAAAAAATAAAAGATGAAGTGTGAAA 
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FTCTTRE 32 

MCSRVPLLLPI^l^PGVQGCPSGCQCSQPQT^CTARQGT^^ 

GIT^DAGSFAGLPGIX3IJ^LSQNQIASLPSGVFQPLANLSNLDLTANRLHEITNETFRGIjR 

R^RLYI^KNRIRHIQPGAFDTI^RLI^LKLQDNELRALPPLRLPRLLIiDLSHNSI^EP 

GILDTANVEALRIAGLGLQQI^EGLFSRLP^HDLDVSDNQLERVPPVIRGI^GLTRI^ 

NTRIAQLRPEDLAGLAALQELDVSiniSLQALPGDLSGLFPRLRIjLAAARNPTO 

PWVRESHVTTIASPEETRCHFPPKNAGRIjLLELDYADFGCPATTTTATVPTTRPVVRE 

SSIAP^SPTAPATEAPSPPSTAPPTVGPVPQPQDCPPSTCI^CMTCHWTRH^ 

gftglycesqmgqgtrpsptpvtprpprslt^iepvsptsi^vgwrywssvqi^slrl 
tyrnlsgpdki^vtlrlpasi^ytvtqi^pnatysvcvmplgpgrvpegeeacgeahtppa 

vhsnhapvtqaregnlplliapaij^vli^^ 
pi^legvkvpi^pgpkateggge^psgsecevplmgfpgpglqsplhakpyi 
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GAATCATCCACGCACCTGCAGCTCTGCT 

CITCATCATTCATATGAGGAAATAAGTGGTAAAATCCTT 

AAACATTTACATATTTTGTAGTATTGTTAT^^ 

AAGAAAGGGAACTGATGACCAACTGCTCXIAACATGTCTCTAAGAAAG 

ACCCCAGCCACAACGACACTGGATTTATCCTATAACCTCCTTTTTCAACT 

TITTCATTCTGTCTCCAAACTGAGAGTTTTGATTCTATGC<^ 

ATCTCAAAACCTTTGAATTCAACAAGGAGTTAAGATATTTAGATTTG 

AAGAGTGTAACTTGGTATTTACTGGCAGGTCTCAG 

TGACACCATGCCTATCTGTGAGGAAGCTGGCAACATGTCACACCTGGAAATCCT 

GTGGGGCAAAAATACAAAAATCAGATTTCCAGAAAATTGCTCATCT 

TTCTTAGGATTCAGAACTCTTCCTCATTAT^ 

AAAACTGCACATTGTTTTACCAATGGACACAAATTTCTGGGTTCTTTTGCGTG 

AGACTTCAAAAATATTAGAAATGACAAATATAGATGGCAAAAGCCAATTTC 

ATGCAACGAAATCTTAGTTTAGAAAATGCTAAGACATCGGTTCTATTGCT 

TTTACTCTGGGACGACCTTTTCCTTATCTTACAATTTGT 

TTCAGATCCGAAATGTGACTTTTGGTGGTA 

TCAAATACTGTAATGAGAACTATAAAATTGGAGCATGTACATTTCAGAGTGTTTT 

ACAGGATAAAATCTATTTGCTTTTGACCAAAATGGACATA 

CACAAATGCCACACATGCTTTTCCCGAATTATCCTACG 

AATAATATCTTAACAGACGAGTTGTTTAAAAGAACTATCCAACTGCCTCA 

CATTTTGAATGGCAATAAACTGGAGACACTT^ 

CCTTGGAAGACTTGGATCTGAGTCAAAATCTATTACAACA 

TGGCCAGAAACTGTGGTCAATATGAATCTGTCATACAATA 

GTGCTTGCCCAAAAGTATTCAAATACTTGACCTAAATAATAAC^ 

AAGAGACTATTCATCTGATGGCCTTACGAGAACTAA^ 

CTCCCTGGATGCAGTCATTTCAGTAGACTTTCAGTTCT 

CAGCCCATCTCTGGATTTTGTTCAGAGCTGC^ 

ATCCATTCCGGTGTACCTGTGAATTAAAAAATTTCATTCAGCTTGAAACATATTCAGAGGTC 
ATGATGGTTGGATGGTCAGATTCATACACCTGTGAATACCCTTTAAACCTAAGGGGAACTA^ 
GTTAAAAGACX3TTCATCTCCACGAATTATCTTGCAACACAGCTCTGTTGATTGTCAC 
TGGTTATTATGCTAGTTCTGGGGTTGGCTGTGGCC 

TGGTAT CTCAGGATGCTAGGTCAATGCACACAAA CATGGCACAGGGTTAGGAAAAC^ 
AGAAGAACTCJVAGAGAAATGTCCGATTCCAGGCATTTATTTC 

TGTGGGTGAAGAATGAATTGATCCCCAATCTAGAGAAGGAAGATGGTTCTATCTTGATTTGC 
CTTTATGAAAGCTACTTTGACCCTGGCAAAAGCATTAG 
GAAAAGCTATAAGTCCATCTTTGTTTTGTCTCCCAACTTTGTCCAGAAT^ 
ATGAATTCTACTTTGCCCACCACT^TCTCTTCCATG 

TTACTGGAACCCATTCCATTCTATTGCATTCCCACCAGGTATCATAAACTGA 
GGAAAAAAAAGCATACTTGGAATGGCCCAAGGATAGGCGT^ 

ACCTTCGAGCTGCTATTAATGTTAATGTATTAGCCACCAGAGAAATGTATGAACTGCAGACA 

TTCACAGAGTTAAATGAAGAGTCTCGAGGTTCTACAATC^ 

ATAAAATCCCACAGTCCTTGGGAAGTTGGGGACCACATACACTCTTGG^ 

CAACCTTTATGATGGCAATTTGACAATATTTATTAAAATAAAAAATGGTTATTCCCTTCATA 

TCAGTTTCTAGAAGGATTTCTAAGAATGTATCCTATAGAAACACCTTCACAAGTTTATAAGG 

GCTTATGGAAAAAGGTGTTCATCCXIAGGATTGTTTATAATCATGAAAAATGTGGCCAGGTGC 

AGTGGCTCACTCTTGTAATCCCAGCACTATGGGAGGCCAAGGTC 

GAGATGGAGACCATCCTGGCCAACATGGTGAAACCCTGTCTCTACTAAAAATA<^ 

GCTGGGCGTGATGGTGCAGGCCTGTAGTCCCAGCTACTTGGGAGGCTGAGGC^GGAGAAT<^3 

CTTGAACCCGGGAGGTGGCAGTTGCAGTGAGCTGAGATCGAGCCACT^ 

GACAGAGCGAGACTCCATCTCAAAAAAAAGAAAAAAAAAAAAGAAAAAAATO 

TCATGGCCACAAAATAAGGTCTAATTCAATAAATTATAGTACATTAATGTAATATAATATTA 

CATGCCACTAAAAAGAATAAGGTAGCTGTATATTTCCTGGTATGGAAAAAACATATO 

GTTATAAACTATTAGGTTGGTGCAAAACTAATTGTGGTTTTT<5CCATTG^^ 

ATAAAAGTGTAAAGAAATCTATACCAGATGTAGTAACAGT<^ 

TTACAGGGAGCATTTGATTTCTATCTTGTGTATTTOTATAATGTTTC 

ATCTGTATTTCTTTTATAAGTAGAAAAAAAATAAAGATAGTTTTTACAGCCT 
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MRLIRNIYIFCSIVMTAEGDAPELPEERELMTNCSNMSLRKVPADLTPATTTLDLSYNLLFQ 
LQSSDFHSVSKLRVLILOMRIQQLDLKTFEFNKELRY^ 

SFNDFmMPICEEAGNMSHLEILGLSGAKIQKSDFQKIAHLHLNTVFLGFRTLPHYEEGSLP 
ILNTTKLHIVLPMDTNFWVLLRIXSIKTSKILEMTNIDGKSQPVSYEMQro 
IJJKVDLLWDDLFLI LQFVWHTS VEHFQ I RNVTFGGKAYLDHNS FDYSNTVMRT I KLEHVHFR 
VFYIMDKIYIiLTKMDIENLTISNAQMPHMLFPNYPTKFQYlOTANNILTDELPKRTIQLP 

HLKTLILNGNKLETLSLVSCFANNTPI^HLDLSQ^^ 

DSVFRCLPKSIQILDLNNNQIQTVPKETIHLMALRELNIAFNFLTDLPGCSHFSRLSVLNIE 
MNF I LSPS LDFVQS CQEVKTLNAGRNPFRCTCELKNF I QLETYSEVMMVGWS DSYTCE YPLN 
LRGTRLKDVHLHELSCNTALLI VT I WI MLVLGLAVAF CCLHFDLPWYLRMLGQCTQTWHRV 
RKTTQEQLKRNVRFHAFISYSEHDSLWVKNELIPNLEKEDGSILICLYESYFDPGKSISENI 
VSFIEKSYKSIFVLSPNFVQNEWCHYEFYFAHHNLFHENSDHIILILLEPIPFYCIPTRYHK 
LKALLEKKAYLEWPKDRRKCGLFWANLRAAINVNVLATREMYELQTFTELNEESRGSTIS^ 



RTDCL 
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FIGURE 3 5A 

gggggctttcttgggcttg^ 

cgccgggaaagggagggaagaaggaagggcggggccggcccc^ 

ctgcgcgcccctgtccgccccggc^ 

gcc^gccggccgcctcccgcgcccaagc^^ 

cxsccagcttctxtcgcctocagccc^ 

cggggcggagcaggcatgtcccgcccggggaccgctac^ 

ggcagtgaccctggccggggtcggagcccagggcgcagccct 

ggc^ggagatctggagccgggagccctactac^^ 

tctccgccgctgcctgcggggcccggggaggagtgggagcggcgcccgcaggagccca<x5c^ 

gcccaagagggcgaccaagcccaagaaagctcccaagagggag^ 

ctccaccaggtaaacacagcaacaaaaaagtt^ 

aacgatgatc^cagtgtccgtctggcccgtgaagatgtcagagagagtttc 

tctggaaaccttaaaaatcacagacttccagctccatc 

tgggggcacatcgagggagactcaacatcc^ 

ggagcgtggtgcgcgggaagaaatgacctccagcagtggattgaagtcgatc 

gaccagattcactggtgtcatcactcaagggaggaactcccrot 

catcctataaggtcatggtgagcaatgacag^ 

ggagacatgatatttgagggaaacagtgagaaggagatccctgttctcaatga 
ccccatggtgk^ccgctacatccgcataaa 
gcatgagaatggagatcctgggctgcccactc 
aacgagatgaccaccactgatgacctggattttaagcacc^ 
gttgatgaaagttgtgaatgaaatgtgtcccaatatca 
gccaccagggcctgaagctgtatgctgtggagatctcagatcaccctggggagc^ 
ggtgagcccgagttcc^ctacatcgcgggggcccacggcaatgaggtgcr 
gctgctgctgctggtgcagttcgtgtgtcaggagtacttggcccgg7vatgcgcgcat03tcc 
Tacctggtgga^ 

gcctacgaagggggctcggagctgggaggctggtccct 

tgacatcaacaacaactttcctgatttaaac^ 

atgtccccaggaaagttcccaatc^ctatattgc 

gccacggtggctgccgagaccagagcagtcatagcctggatggaaaaaatccct 

gggcggcaacctgcagggcggcgagctggtggtggcgtatccct 

cctggaagacgcaggaacacacccccacccc 

tcctatgcctccacacaccgccrcatgacagacgccot 

cttccagaaggaggagggcactgtcaatgg^ 

acgatttcagctaccttcatacaaactgcttcgaactgtccatctacg^tc 

tacccacatgagagccagctgcccgagga 

catggagcaggttgatcgtggcattaaaggct 

caaacgccattatctccgtagaaggcattaacc^^ 

tactggcgcctcctgaaccctggagagtatgtggt 

atccacg^gaactgtatggttggctatgac^ 

gcaaaaccaacatggccaggatccgagagatcatggaga^ 

ctgccagccaggcggctgaagctgcgggggcggaagagacgacagggtggg 

ggcccttgagactcgtctgggacccatgcaaatt^ 

gactcactcactgttgtttcctctgtaattcaagaagtgcctc 

AGGCAGGTCCCAAAAGGGAAGGCTGGAGGCTGAGGCTGTTTT 

TCCAAATAACTTGGACAGAGCAGCAGAGAAAAGCTGATGGGAGTGAGAGAACTCAGCAAGCC 
AACCTGGGAATCAGAGAGAGAAGGAGAAGGAGGGGAGCCTGTC^ 
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ATAGAAAAGGATTCTGGTGCTTCCCCTGTTTGCGTGGCAGCAAGGGTTCCACGTGC^TTTGC 
AATTTGCACAGCTAAAATTGC^GCATTTCCCCAGCTGGGCTGTCCCAAATGTTACCATTTGA 
GATGCTCCCAGGCGTCCTAAGAGAATCCACCCTCTCTGGCCCTGGGACATTGCAAGCTGCTA 
CAAATAAATTCTGTGTTCTTTTGACAATAGCGTCATTGCCAAGTGCACATCAGTGAGCCTCT 
TGAATCTGTTTAGTCTCCTTTTTCAACAAAGGAGTGTGTTCAGAAAAGGAGAGAGAGGCTGA 
GATCATTCAGGAGTTTGTTGGGCAGCAAGCATGGAGCTTCTTGCACAAATTCTGGGTCQ^TA 
AACAACCCCCAAAGTCCCTGCTGATCCAGTAGCCCTGGAGGTTCCCCAGGTAGGGAGAGCCA 
GAGGTGCCAGCCTTCCTGAAGGGCCAGAAAATTTAGCCTGGATCTCCTCTTTTACCTGCTAG 
GACTGGAAAGAGCCAGAAGTGGGGTGGCCTGAAGCCCTCTCTCTGCTTGAGGTATTGCCCCT 
GTGTGGAATTGAGTGCTCATGGGTTGGCCTCATATCAGCCTGGGAGTTATTTTTGATATGTA 
GAATGCCAGATCTTCCAGATTAGGCTAAATGTAATGAAAACCTCTTAGGATTATCTGTGGAG 
CATCAGTTTGGGAAGAATTATTGAATTATCTTGCAAGAAAAAAGTATGTCTCACTTTTTGTT 
AATGTTGCTGCCTCATTGACCTGGGAAAAATGAAAAAAAAAAATAAAGCAAATGGTAAGACC 
CTTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 36 



MSRPGTATPALALVLLAVTIAG^ 

AGPGEEWERRPQEPRPPKRATKPKKAPKREKSAPEPPPPGKHSN^ 
VRVAREDVRESCPPLGLETLKITDPQLHASTVKRYGLGA 

GRNDLQQWIEVDARRLTRFTGVI TQGRNSLWLSDWVTS YKVMVSNDSHTWVTVKNGSGDMI F 
EGNSEKE I PVLNELPVPMVARY IR INPQSWFDNGS I CMRME I LGCPLPDPNNYYHRRNEMTT 
TDDLDFKHHNYKEI^QLMKVVNEMCPNITRIYNIGKSHQGLKLYA 
HYIAGAHGlSngVLGRELLLLLVQ 

SELGGWSLGRWTHDGIDINNNFPDLNIIjLWEAEDRQNVPRKVPNHYIA 

ETRAVIAWMEKIPFVLGGNLQGGELWAYPYDLVRSPWKTQEHTPTPDDHVFRWLAYSYAST 

HRLMTDARRRVOTTEDFQKEEGTVNGASWHTVAGSLNDFSYLHTNCFELSI^^ 

QLPEEWENNRESLIVFMEQVHRGIKGLVRDSHGKGIPNAII^ 

NPGEYVWAKAEGFTASTKNCMVGYDMGATRCDFT^ 

LKLRGRKRRQRG 
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FIGURE 37 

CTAAGAGGACAAGA^AGGCCCGGCCTCTCATTTCTGCTAGCCCTTCT 

AAGCTGCAGGGGATTTGGGGGATGTGGGA 

CCAGfGTGTTGACTCCAGCTCCAGCTO 

CAGCITAGGCAGCGGAGGTTCTGTGTCCCAGTTGTTT^ 

ACCGTGGGACCTGCCAGTGCTCTGTTTCCCTGCCAGA^ 

GAACGCTTGGAATTCACAGCTCATGTTCTTTC 

GAGGGAATATGTCCAATTAATTAGTGTGTATGAAAAGAAACTGTTAAACCTAACTGTCCGAA 
TTGACATCATGGAGAAGGATACCATTTC 

GAAGTGAAGGAGATGGAAAAACTGGTCATACAGCTGAAGGAGAGTTTTGGTC 
AATTGTTGACCAGCTGGAGGTGGAGATAAGAAATATGACT 

CACTAGACAAAAACAATGTCCTTGCCATTCGCCGAGAAATCGTGGCTCTGAAGACCAAGCT 

AAAGAGTGTGAGGCCTCTAAAGATCAAAACACCCCTGTCGTCCACCCTCCTCCCACTCCAGG 

GAGCTGTGGTCATGGTGGTGTGGTGAACATCAGCAAACCGTCTGTGGTTCAGCrrCAACTGGA 

GAGGGTTTTCTTATCTATATGGTGCTTGGGGTAGGGATTACTCTCCCCAGC^ 

GGACTGTATTGGGTGGCGCCATTGAATACAGATGGGAGACTGTTGGAGTATTATAGACTGTA 

CAACACACTGGATGATTTCCTATTGTAT 
AAGGTAGTGGTACAGCAGTTTAGAACAACAACAT^ 
ATTGCCAGAGTTAACCTGACCACCAAC^ 
CTATAATAACCGCTTTTCATATGCTAATGTTGCT 

AGAATGGATTGTGGGTTATTTATTCAACTGAAGC C AGC ACTGGTAACATGGTGATTAGTAAA 

CTCAATGACACCACACTTCAGGTGCTAAAGACTTGGTATACCAAGCAGTATA 

TTCTAACGCCTTCATGGTATGTGGGGTT^ 

AAGAGATTTTTTACTATTATGACACAAACACAGGGAAAGAGGGGAAACTAGA 

CATAAGATGCAGGAAAAAGTGCAGAGCATTAACTATAACCCTTTTGACCAGAAACTTTATGT 

CTATAACGATGGTTACCTTCTGAATTATGATCTTTCTGTCTTGCAGAAGCCCCAGT^GCTG 

TTTAGGAGTTAGGGTGAAAGAGAAAATGTTTGTTGAAAAAATAGTCITCTCC^CT 

ATATCTGCAGGGGTGTCTAAAAGTGTGTTCATTTTGCAGCAATGTTTAGGTGC^TAGTTCTA 

CCACACTAGAGATCTAGGACATTTGTCTTGATTTGGTGAGTTCTCTTGGGAATCATCTGCCT 

CTTCAGGCGCATTTTGCAATAAAGTCTGTCTAGGGTGGGATTGTCAGAGGTCTAGGGGCACT 

GTGGGCCTAGTGAAGCCTACTGTGAGGAGGCTTCACTAGAAGCCTTAAATTAGGAATTAA^ 

AACITAAAACTCAGTATGGCGTCTAGGGATO^ 

TCCTCATCCATGTAGCACCACTAATTCT 

CACCTGGAATGATGCTTTGTATGTGGCAGATAAGTAAATTTGGCATGCTTATATATTCT 
TCTGTAAAGTGCTGAGTTTTATGGAGAGAGGCCTT^ 

TAAATCCCAGAAGGATCTGTAGATGAGGCACCTGCTTTTTCTTTTCTCTCATTGTCCACCTT 

ACTAAAAGTCAGTAGAATCTTCTACCTCATAACTTCCTTCC^^GGCAGCTCAGAAGATTAG 

AACCAGACTTACTAACC^TTCCACCCCCCACCAACCCCCTTCrACTGCCTA 

ATTAATAGTTTTCTATGGAACTGATCTAAGATTAGAAAAATTAAT^ 

TGGACTTTTATTTACATGACTCTAAGACTAT^ 

GCATTTATTGTTATCTAATAAAGACCTTGGAGCATATC 

TTGCATGTAATTTTTGCCTTTGTTTAAGCCT 

TTTTTCTAGGACGAGCTATAGAAAAGCTATTGAGAGTATCT 

GAAACCTTGCTGGTGTATGTGATGTGCTTCTGTGCTTTTGAATGACTTTATC^ 

TGTCTATTTTTCCTTTGATGTTCAAGTCCT^ 

TCCCCCTTTTAAAATAAATGATTAAAATGTGCTCT 
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FIGURE 3fi 

MRPGLSFLIJ^FFLGQAAGDLGDVGPPIPSPGFSSFPGVDSSSSPSSSSRSGSSSSRSLGS 
GGSVSQLFSNFTGSVDDRGTCQCSVSLPDTTFPVDRVERLEFTAHVLSQKFEKELSKVREYV 
QLISVYEKKLLNLTVRIDIMEKDTISYTELDFELIKVEVKEMEKLVIQLKESFGGSSEIVDQ 
LEVE I RNMTLLVEKLETLDKNNVLAI RRE I VALKTKLKECEAS KDQNTP WHP PPTPGSCGH 
«^raiSKPSWQLNVTOGFSYLYGAWGRDYSPQHPNKGLYWAP 

DLLL YI NARELR I TYGQGSGTAVYNNNMYVNMYNTGNI ARVNLTTNTIAVTQTLPNAAYNNR 
FSYANVAWQDIDFAVDENGLWIYSTEASTGNMVISICLNDTTLQVLNTWYTKQYKPSA 

MVCGVLYATRTMNTRTEEI F YYYDTNTGKEGKLDI VMHKMQEKVQS INYNPFDQKLYVYNDG 
YLLNYDLSVLQKPQ 
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GCTCTGAAGACCAAGCTGAAAGAGTGT6AGGCCTCTAAAGATCAAACACCCCTGTCGTCCAC 
CCTCCTCCCACTCCAGGGAGCTGTGGTCATGGTGGTGTGGTGAACATCAGCAAACCGTCTGT 
GGTTC^GCTCAACTGGAGAGGGTTTTCrrTATCTATATGGTGCTTGGGGTAGGGATTACTCTC 
CCCAGCATCCAAACAAAGGNATGTATTGGGNGGCGCCATTGAATACAGATGGGAGACTGTTG 
GAGTATTATAGACTGTACAACCCACTGGATGATTTGCTATTGTATATAAATGCTOGAGAGTT 
GCGGATCACCTATGGCCAAGGTAGTGGTACAGCAGTTTACy^a\ACAACATGTACGTCAACA 

TGTACAACACCGGGNATATTGCCAGAGTTAACCTGACC 
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TCTCGCAGATAGTAAATAATCTCGGAAAGGCGAGA 
CCGCTGCTCTTGTGACGTTGTGGAGATGGGGAG 

GGATACCATGTTTGTGTGGAAGTGCCCCGTGTTTGCTATGCCGATGCTGTCCTAGTGGAAAC 

AACTCCACTGTAACTAGATTGATCTATGCACTTTTCTTGCTTGTTGGAGTATGTGTAGCTTC 

TGTAATGTTGATACCAGGAATGGAAGAACAACTGAATAAGATTCCTGGATTTTGTGAGAATC 

AGAAAGGTGTTGTCCCTTGTAACATTTTGGTTGGCTATAAAGCTGTATATCGTTTO 

GGTTTGGCTATGTTCTATCTTCTTCTCTCTT^ 

TCCTAGAGCTGC^GTGCACAATGGATTTTGGTTCTTTAAATTTGCTG 

TTATTGGGGCATTCTTCATTCCAGAAGGAACTTTTACAACTGTGTGGT^ 

GCAGGTGCCTTTTGTTTCATCCTCATACAACTAGTCTT^ 

GAATGAATCGTGGGTTGAAAAAATGGAAGAAGGGAACTCGAGATGT^ 

TATCAGCTACAGCTCTGAATTATCTGCTGTCTTTAGTTGCTATCGTCCT 

TACACTCATCCAGCCAGTTGTTCAGAAAACAAGGCGTTCATCAGTGTCAA 

CGTTGGTGCTTCTGTAATGTCTATACTGCCAAAAATCCAAG 

TGTTACAGTCTTCAGTAATTACAGTCTACACAATGTATTTGACATGGTCAGCT 

GAACCAGAAACAAATTGCAACCCAAGTCTACTAAGCATAATTC 

TGTCCCAAAGGAAGGGCAGTCAGTCCAGTGGTGGCATGCTCAAGGA 

TCTTTTTGTTGTGTGTATTTTATTCCAGCATCCGTACTTCAAACAATAGTCAGGT^ 

CTGACTCTAACAAGTGATGAATCTACATTAATAGAAGATGGTGGAGCTAGAAGTGATGGATC 

ACTGGAG GATGGGG ACGATGTTCACCGAGCTGTAGATAATGAAAGGGATGGTGTCACTTACA 

GTTATTCCTTCTTTCACTTCATGCTTTTCCTGGCTTCAOT 

AACTGGTCCAGGTATGAACCCTCTCGTGAGATGAA 

AATCTCITCCAGTTGGATTGGC^TCG 

TTACAAATCGTGATTTTGACT^GTGAGACTTCT^ 

TTATTTGAAAACAGTATTCCCAACTTTTGTAAAGTTC 

TTCTCCAGTGTTCTGGCATGAATTAGATTTTACTGCTTGTCATTTTGTTA 

GTGCATTGATATGTGAAGTAGAATGAATTGCAGAGGAAAGTTTTATGAATATGGTGATGAGT 

TAGTAAAAGTGGCCATTATTGGGCTTATTCTCTGCTCTATAGTTGTGAAATGAAGAGTAAAA 

ACAAATTTGTTTGACTATTOTAAAATTATATTAGACCTTAAGCTGTTTT 

GCAAATGTATGGCTGCCTTTTGAAATATTTGATGTGTTGCCTGGCAGGATACTGCAAAG 

AT GGTTT ATTTTAAAATTTA TAAA CAAGTCACTTAAATC 

AGGTTTTACCCnTG A TACXXaAATTTAGA C AGGTA G G 

TGGATGGAGGTGTCGGTACTAAATTGAATAACGAGT/^ 

CCTTTGCCAACAAAGTGAACTGTTTTGGTTGTT^ 

AATGTTTGGAACTCTGAAGGATTTAGACAAGGTTTTGAAAAGGATAATCATGGGTTAGAAGG 
AAGTGTTTTGAAAGTCACTTTGAAAGTTAGTTTTGGGCC 

AATCCCAGCACTTTGGGAGCTTAAGTGGGTAGATTACTTGAGCCCAGGAATTCAGACCAGCT 

TGGCACATGGTGAACCTGTTOTATAAAAATAATCTGGCTTTGAGCATATGCCTGTGG 

CACTGAGAGGCTAGTGAAGATTGCTGAGCCCAGAGCC^AAGGTTGCAGTGAGCAAG 

CACTGCACTCTAGCTGGCACAGAGTAAGCCAAAAAAATATAT^ 

CAAAATTTTGACAGGGAAGGAAGTAACTGC7UUVACCACTAGGCTTTAGTAGG 

AAATCTAGTCCAGTTCTCTCATTTAAAAAAATGAAGACACTGAAATAC^ACTTAAATAGCT 

CAGATAGCTAATTAGGAAATTTCAAGTTGGCCAATAATAGCATTCTCT 

TAATTTCTATTCAAAATACATGCATATTGATTTACACCTCATACTGTGATAATTAATC 

GTGGATTGCTGGTGTCCAGCATGACCCATAAACAGGTCAGAAGAATGATGGAATGTTTTAGA 

ATAAACTCCTGCTTATAGTATACTACAC^VGTTCAAAAGATC 

CTGCCATGTAATTGAAATATATAGATTATra 

AGAGTTTAGTTATTTGTATGTGTCACTAGTGTCTAATGAAGCTTTTAAAATCTACAATT^ 

tctttaaaaatatttattaatgtgaatggaatataacaattcagcttaattcccgaacctta 

TTCTGTGTGTAGACATTGTATTCCACAATTTTGAATGGCTGTGTTTTAGCTCTAAATAAATC 
AATTCAGAGAAAAAAAAAAAAAAA 
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MGSVLGLCSMASWIPCXCGSAPCLLCRCCPSG^ 

EQLNKIPGFCENEKGVVPCNILVGYKAVYRLCFGLAMFYLLLSLLMIKVKSSSDPRAAVHNG 
FWFFKFAAAIAIIIGAFFIPEGTFTTVWFYVGMAGAFCFILIQLVLLIDFAHSWNESWVEKM 
EEGNSRCWYAALLSATAI^LLSLVAIVLFFVYYTHPASCSENKAFISVNMLLCVGASVMSI 
LPKIQESQPRSGLLQSSVITVYTMYLTWSAMTNEPETNCNPSLLS I IGYNTTSTVPKEGQSV 
QWVraAQGIIGLILFIiLCVFYSSIRTSNNSQVNiCLTLTSDESTLIEDGGARSDGSLEDGDDVH 
RAVDI^RDGVTySYSFFHFMLFLASLYIMMTLTNWSRYEPSREMKSQVrTAVWVKISSSWIGI 



VLYVWTLVAPLVLTNRDFD 
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FIGURE 42 

GCGAGAAAGAAGCTGTCTCCATCTTGTCTGTATCCCGCTGCT 
GGGGAGCGTCCCTGGGGCTGTGCTCCATGra 

CCGTGTTTGCrATGCCGATGCTGTCCTAGTGGAAACAANTCCACTGTAACTAGA 

TGCACTTTTCTTGCTTGTTGGAGTATGTGTAGCTTGTGTAATC 

AACAACTGAATAAGATTCCTGGATTTTGTGAGAATG 

TTGGTTGGCTATAAAGCTGTAT^^ 

CTCTTTACTAATGATCAAAGTGAAGAGTAG 

TTTGGTTCTTTAAATTTGCTGCA^ 
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GTTATTGTGAACTTTGTGGAGATGGGAGGTCNTGGGGCTGTGTTCCATGGCGAGCTGGATAC 
CANGTTTGTGTGGAAGTGCCCCGTGTTTGNTATGCCGATGCTGTCCTAGTGGAAACAANTCC 

ACTGTAATTAGATTGATNTATGCACTTTTNTT 

GTTGATACCAGGAATGGAAGAACAACTGAATAAGATTCCTGGATTTTGTGAGAATGAGAAAG 
GTGTTGTCCCTTGTAACATTTTGGTTGGCTATAAAGCTGTATATNGTTTGTGCTTTGGTTTG 
GCTANGTTCTATNTTCTTCTCTCTTTACTAATGATC^AAGTGAAGAGTAGCAGTGATCCTAG 
AGCTGCAGTGCACAATGGATTTTGGTTTTTTAAATTTGCTGCAGCAATTGCAATTATTATTG 



GGGC 



WO 99/63088 \\^/^y\0 PCT/US99/122S2 

FIGURE 44 

AAGAAGCTGTCTCCATCTTGTCTGTATCCGCT 

GTCCTT<K5GGTTGTGCTCCATGGCGAGCTGGATACCATGTTTC 

TGCTATGCCGATGCTGTCCTAGTGGAAACAACTCCACTGTAACT 

TTCTTGCTTCTTGGAGTATGTGTAGCTTGTGTAATGTTGATA 

GAATAAGATTCCTGGATTTTGTGAGAATGAGAAAGGTGTTGTCCCTTG 

GCTATAAAGCTGTATATCGTTTGTGCTTTG^ 

CTAATGATCAAAGTGAAGAGTAGCAGTGATCCTAGAGCT^ 

CTTTAAATTTGCTGCAGCAATTGCAATTATTATTGGGGC 
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FIGURE 45 

GCTGTCCTTAGTGGAAACAANTCCAACTTGTAACTTGGATTGATCTATGCACTTTTTCCTTG 

CTTGTTGGAGTATGTGTAGCTTTGTGTAATGTTGTTCCCAGGATTGGANGAACAACTGAATA 

AGATTCCTGGATTTTTGTGAGAATGAGAAAGGTGTTGTCCCCrTGTAACArrirL 

TATAAAGCTGTATATCGTTTGTGCTTTGGTTTGGCTATGTTCTATCTTOT 

AATGATCAAAGTGAAGAGTAGCAGTC^TCCTAGAGCTGCAGTGCACAATGGATTTT^ 

TTAAATTTGCTGCAGCAATTGCAATTATTATTGGGGCATT 

ACAACTGTGTGGTTTTATGTAGGCATGGCAGGTGCCTTTTGTTTCATCCTCATACAACTAGT 

CTTACTTATTGATTTTGCACATTCATGGAATGAATCGTGGGTTGAAAAAATGGAAGAAGGGA 

ACTCGAGATGTTGGTATGCAGCCrTGTTATCAGCTACAGCTCTGAATTATCTGCTGTOT 

GTTGCTATCGTCCTGTTCTTTGTCTACTA<»CTCATCCAGCCAGTTGTTCAGAAAACAAG^ 

GTTCATCAGTGTCAACATGCrCCTCrGCGTTGGTGCTTCTGTAATG 
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FIGURE 46A 

CTCGGGCGCGCACAGGCAGCTC^ 
GGCCTCTCCAATGGCAAATGTGTGTGGCTGGAGG 

CGAGTGTTTGCAGACCGGGGCGAGTCCTGTGAAAGCAGATAAAAGAAAACATTTATO 

GTCATTACGAGGGGAGCGCC03GCCGGGGCTGTCGCACTCCCCGCGGAACAT 

CCAGCTCCGAGAGAGGAGAAGAAGAAAGCGGAAAAGAGGCAGATTCACGTCGT^ 

AGTGGACCTGATCGATGGCCCTCCTGAATTTATCACGATATTTGATTTATTAGCGATGCCCC 

CTGGTTTGTGTGTTACGCACACACACGTGCACACAAGGCTCT^ 

TTCCAGCTCCTGGGCGAATCCCACATCTGTTTCAACTCTCCGC 

GAGTGTGTGGAATCTGCGAGTGAAGAGGGACGAGGGAAAAGAAACAAAGCCACA 

TTGAGACTCCCGCATCCCAAAAGAAGCACCAGATCAGCAAAAAAAGAAGATGGGCCCCC 

GCCTCGTGCTGTGCTTGCTGTCCGCAACTGTGTTCTCCCTGCTGGGTGGAA 

CTGTCGCACCACCGCCTGAAAGGCAGGTTTCAGAGGGACCGC^ 

CATCCTGGTGCTGACGGACGACCAGGATGTGGAGCnXjGGTTCC^ 

CCCGGCGCATCATGGAGCAGGGCGGGGCXJCACTTCATCAAG^ 

TGCTGCCCCTCACGCTGCTCCATCCTCACrGGCAAGT^ 

CAACAATGAGAACTGCTCCTCGCCCTCCTGGCAGGCACAGCATO 

TGTACCTCAATAGCACTGGCTACCGGACAGCTTTCTTCGGGAAGTA 

GGCTCCTACGTGCCACCCGGCTGGAAGGAGTGGGTCGGACTC 

TAACTACACGCTGTGTCGGAACGGGGTGAAAGAGAAGCACGGCTCCGACTACTCCAAGGAT^ 
ACCTCACAGACCTCATCACCAATGACAGCGTGAGCTTCT^ 
CCGCACAGGCCAGTCCTCATGGTCATCAGCCATGCAGCCCCCCAGGGCCCTG 
CCCACAATATTCACGCCTCTTCCCAAACGCATCTCAGCAC^ 
CGCCCAACCCGGACAAACACTGGATCATGCGCTACACGGGGCCCATGAAGCC<^ 
GAATTCACCAAGATGCTGCAGCGGAAGCGCTTGCAGACCCTCATGTCGG 
GGAGACGATTTACAACATGCTGGTTGAGACGGGCGAGCTGGACAACACGTACATOGTATAC^ 
-GCGCGGAGGAG^T-TACX^GATC^CC^GT-^^ 
GAGTTTGACATCAGGGTCCCGTTCTAOjTGAGGGGCGCCAACGTGGAAGC 
TCCCCACATCGTCCTC^CATTGACCTGGCCCGCACCATCC 

TACCTGCGGATATGGACGGGAAATCCATCCTCAAGCTGCTGGACACGGAGCGGCCXK3TGAAT 
CGGTTTCACTTGAAAAAGAAGATGAGGGTCTGGCXX3GA 

GCTGCTACACAAGAGAGACAATGACAAGGTGGACGCCCAGGAGGAGAACTTTCTGCCCAAGT 
ACCAGCGTGTGAAGGACCTGTGTCAGCGTGCrGAGTACCAGACGGCGTGTC 
CAGAAGTGGCAGTGTGTGGAGGACGCCAGGGGGAAGCTGAA 
CATGCGGCTGGGCGGCAGCAGAGGCCTCTCCAACCTCGTC 

GCGAGGCCTGC^CCTGTGAGAGCGGGGACTACAAGCTCAGCCTGGCCGGACGCCGGAAAAAA 
CTCITCAAGAAGAAGTAGAAGGCGAGGTATGTCCGCAGTCGCTC 

CGAGGTGGACGGCAGGGTGTACCACGTAGGCCTGGGTCATGCC^C^CAGCCCGGAAACGT^ 

CCAAGCGGCACTGGCCAGGGGCCCCTGAGGACCAAGATGACAAGGATGGTGGGG^ 

GGCACTGGAGGCCTTCCCGACTACTGAGGCGCCAACCCCATTAAAGTC 

CATCCTAGAGAAGGACACAGTCCAGTGTGACCTGGACCTGTAGAAGTCCCTGCAGG 

AAGACCACAAGCTGCAC^TCGACCAGGAGATTGAAAGCCrGCAGAACAAAATTAA 

AGGGAAGTCCGAGGTGACCTGAAGAAAAAGCGGCCAGAAGAATGTGACTGTCAGA 

CTACCACACCCAGCACAAAGGCCGCCTCAAGCACAGAGGCTCCAGTCrGCATCCTTTCAGG^ 

AGGGCCTGCAAGAGAAGGACAAGGTGTGGCTGTTGC<3^ 

CGCAAGCTGCTCAAGCGCCTGCAGAACAACGACACGTGCAGCATGCGAGGC 

CACCCAC<3ACAAGGAGCACTGGCAGACGGCXiKZCrTTCT^ 

GCACC^^CGCCAACAATAACAOGTACTGGTGCATGAGGAGCAT 



WO99/63088 1^/6^0 fCWWWUM 

FTOIJRE 46B 

CTCTTCTGTGAATTTGCAACT6GCTTCCTAGAGTACTTTGATCTCAACACAGACCCCTACCA 
GCTGATGAATGCAGTGAACACACn^ACAGGGATGTCCTCAACCAGCTACACGTACAGCTCA 
TGGAGCTGAGGAGCTGCAAGGGTTACAAGCAGTGTAACCCCCGGACTCGAAACATGGACCTG 
GATGGAGGAAGCTATGAGCAATACAGGCAGTTTCAGCGTCGAAAGTGGCCAGAAATGAAGAG 
ACCTTCTTCCAAATCACTGGGACAACTGTGGGAAGGCTGGGAAGGT2AAGAAACAACAGAGG 
TGGACCTCCAAAAACATAGAGGCATCACCTGACTGCACAGGCAATGAAAAACCATGTGGGTG 
ATTTCCAGCAGACCTGTGCTATTGGCCAGGAGGCCTGAGAAAGCAAGCACGCACTCTCAGTC 
AACATGACAGATTOTGGAGGATAACCAGCAGGAGCAGAGATAACTTCAGGAAGTCCATTTTT 

GCCCCTGCTTTTGCTTTGGATTATACCTCACCAGCTGCACAAA^ 

AAAGTCACCACTAACCCTCCCCCAGAAGCTCACAAAGGAAAACGGAGAGAGCGAGCGAGAGA 
GATTTCCTTGGAAATTTCTCCCAAGGGCGAAAGTCATTGGAATTTTTAAATCATAGGGGAAA 
AGCAGTCCTGTTCTAAATCCTCTTATTCTTTTGGTTTGTCACAAAGAAGGAACTAAGAAGCA 
GGACAGAGGCAACGTGGAGAGGCTGAAAACAGTGCAGAGACGTTTGACAATGAGTCAGTAGC 
ACAAAAGAGATGACATTTACCTAGCACTATAAACCCTGGTTGCCTCTGAAGAAACTGCCTTC 
ATTGTATATATGTGACTATTTACATGTAATCAAC^TGGGAACTTTTAGGGGAACCTAATAAG 
AAATCCCAATTTTCAGGAGTGGTGGTGTCAATAAACGCTCTGTGGCCAGTGTAAAAGAAAAA 
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FIGURE 47 

MGPPSLVLCIjLSATVFSLLGGSSAFLSHHRLKGRFQRDRRNI rpni ilvltddqdvelgsmq 
VMNKTRRIMEQGGAHFINAFVTTPMCCPSRSS ILTGKYVHNHNTYTNNENCSSPSWQAQHES 
RTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNY 
YSKDYLTDLITNDSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSA 

SYNYAPNPDKHWIMRYTGPMKP IHMEFTNMLQRKRLQTLMS VDDSMETI YNMLVETGELDNT 
YIVYTADHGYHIGQFGLVKGKSMPYEFDIRVP 

AGLD I PADMDGKS I LKLLDTERPVNRFHLKKKMRVWRDSFLVERGKLLHKRDN^ 
FLPKYQRVKDLCQRAEYQTACEQLGQKWQCVEDATC 

YGQGSEACTCDSGDYKLSLAGRRKKLFKKKYKASYVRSRS IRSVAI EVDGRVYHVGLGDAAQ 

PRNLTKRHWPGAPEDQDDKDGGDFSGTGGLPDYSAANP I KVTHRCY I LENDTVQCDLDLYKS 

LQAWKDHKLHIDHEIETLQNKIKNLREVRGHLKKKRPEECDCHKISra 

HPFRKGLQEKDKVWLLREQKRKKKLRKLLKRLQNND^ 

PFCACTSANNNTYWC3^TINETHNFLFCEFATGFLEYFDLOT 

HVQLMELRSCKGYKQCNPRTRNMDLIX^ 
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FIGURE 48 

AAOVAAGTTCAGTGACTGAGAGGGCTGAGCGGAGGCTGCTGAAGGGGAGAAAGGAGTGAGGA 
GCTGCTGGGCAGAGAGGGACTGTCCGGCTCCCAGA^CTGGGCCTCCTGGGGAGCACAGCCC 
TCGTGGGATGGATCACAGGTGCTGCTGTGGCGGTCCTGCTGCTGCTGCTGCTGCrGGCCACC 
TGCCTTTTCCACGGACGGCAGGACTGTGACGTGGAGAGGAACCGTACAGCTGCAGGGGGAAA 

CCGAGTCCGCCGGGCCCAGCCTTGGCCCTTCCG^ 

ATCACCGTCATCCTGX3CCACGTATCTCATGTGCCGAATGTGGGCCTCCACCACCACCACCAC 
CCCCGCCACACCCCrrCACCACCTCCACCACCACCACCACCCCCACCGCCACCATCCCCGCCA 

CGCTCGCXGJVGGCTGCTGTCGCCGGTGCCTGTGGAC^ 

TTCCCAGGACAAGTGGACCCCATGTTTCCATGTGGAAGGATGCATCTCTGGGGTGAACGAGG 
GGAACAATAGACTGGGGCTTGCTCCAGCTGCATTTGCATGGCATGCCCCAGTGTACTATGGC 
AGC^GAGAATGGAGGAACACTGGGTCTGCAGTGCTGAAGGGTTTGGGGAGTGGAGAGCAAGG 
GTGCTCTTTCGGGGCTGGACAGCCCGTCTTGTGACAGTGACTCCCAGTGAGCCCCAGAAATG 
ACAAGCGTGTCTTGGCAGAGCCIAGCACACAAGTGGATGTGAAGTGCCCGTCTTGACCTCCTC 
ATC^GGCTGCTGC^GGCCTCTGGCGGGCAGGGCACTGGGAGAGGCCCTGAGAATGTCCTTTT 
GGTTTGGAGAAGGCAGTGTGAGGCTGCACAGTCAATTCATCGGTGCCTTAGTCCAAGAAAAT 
AAAAACCACTAAGAAGCTTTAAAAAAAAAAAAAAAAAAAAA 
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FTOTJRE 49 

MLGLIXSSTALVGWITGAAVAVIiliLI^TCLFHGRQDCDVERNRTAAGGNRVRRAQPWPFR 
RRGHLG I FHHHRHPGHVSHVPNVGLHHHHHPRHT PHHLHHHHHPHRHHPRHAR 
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FIGURE 50 
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GGCGGCTGCTGAGCTGCCTTGAGGTGCAGTGTTGGGGATCCAGAGCCMS 
CTACTGGGCCTGATTGGGGGCCTGACTCTCT 

GTACTCAGGGCTACTGGCTGGGGTGGAAGTGAGTGCTGGGTCACCCCCCATCCGCAACGT^ 
CTGTGGCCTACAAGTTCCACATGGGGCTCT^ 
TGCAGCATCTCTCCCAAGCTCCGCTCCATC 
CCCTGATAAGTGCCGATGTGCCGTGGGCAGCATC 

CTGAGCTCATCGACCTCTACCAGAAATTTGGCTTCAAGGTGTTCTCCTT^ 
CATGTGGTGACAGCCACCTTCCCCTACA^ 
TOTCCATCCTGCCTTGGACACCTACATC^ 
AGATCTACCAGGAAGACCAGATCCATTTCATGTC 

GTGCCTGAGATGAAGGAGACAGAGTGGAAATGGCGGGGGCTTGTGGAGGCC^TTGACACCC^ 

GGTGGATGGCACAGGAGCTGACACAATGAGTGACACGAGTTCTGTAAGCTT 

CTGGCAGCCGGGAGACITCAGCTGCCACACT 

GACGGTGACACCCGCAGCGAGCACAGCTACAGCGAGTCAGGTGCCAGCGGCTCC^ 
GGAGCTGGACTTGGAGGGCGAGGGGCCCTTAGGGGAGTCACGGCTGGACCCTGGGACTGAGC 

CCCTGGGGACTACOVAGTGGCrCT^ 
ATGGCCTGCACCCTCCTGCAGTGCAGTT^ 

CCAGCCCTCTTCCTCCTTCCTCTGXMGGAGGAGGGGTTCCTGAGGGACCTGACTTCCCCTGC 

TCCAGGCCTCTTGCTAAGCCTTCTCXITCACTGCCCTTTAGGCTCCCAGGGCCAGAG 

GGGACTATTTTCTGCACCAGCCCCCAGGGC 

ACAGTGGAGCTTCCAGGACCCAGAATAAAGCCAATGATTTACTTGTTTCACCT 
AAAAAAAAAA 
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FIGURE 51 

MSDLLLLGLIGGLTLLLLLTLI^ 

LFTESCS I SPKLRS IAVYYDNPHMVPPDKCRCAVGS ILSEGEESPS PELIDLYQKFGFKVFS 
FPAPSHVVTATFPYTTILSIWLATRRVHPALDT^ 
QGDFYVPEMKETEWKWRGLVEAIDTQVDGTGA^ 
SRGWDIX^TRSEHSYSESGASGSSFEELDIiEGE^ 
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FTOTTRE 52 

CCGCGGGAACGCTGTCC^^ 

GCCCCGCGCCCAGTC^CCCTGCGCCCCTCACTCCTCCCGCTCCATCTGCTGCTGCTGCT 
GCTGCTC^GTGCGGCGGTGTGCCGGGCrGAGGCTGGGCTCGAAACCGAAAGTCCCGTCCGGA 
CCCTCCy^GTGGAGACCCTGGTGGAGCCCCCAGAACCATGTGCCGAGCCCGCTGCrTTTGGA 

GACACGCTTCACATACACTACACGGG^ 

GACCAGAGACCCTCTGGTTATAGAACTTGGCCAAAAGCAGGTGATTCCAGGTCTGGAGCy^GA 

GTCTTCTCGAO^TGteTGTGGGAGAGAAGCGAAGGGCAATCATTCCTTCTCACTTGGCCTAT 

GGAAAACGGG<^TTTCCACCATCTGTCCCAGCGGATGCAGTGGTGCAGTATGACGTGGAGCT 

GATTGCACTAATCCGAGCCAACTACTGGCTAAAGCTGGTGAAGGGCATTTTGCCTOT 

GGATGGCCATGGTGCCAGCCCTCCTGGGCCTCATTGGGTATCACCTATACAGAAAGGCCAAT 

AGACCCAAAGTCTCCAAAAAGAAGCTCAAGGAAGAGAAACGAAACAAGAGCyU^AAAGAAAT^ 

ATAAATAATAAATTTTAAAAAACTTAAAAAAAAAAAAAAAAAA 



WO 99/63088 PCT/US99/I2252 

FIGURE 53 

MTLRPSLLPLHLLLLLLTjSAAVCRAEAGLETESPVRTLQVETLVEPPEP 
HYTGSLVDGRI IDTSLTRDPLVI ELGQKQVI PGLEQSLLDMCVGEKRRAI I PSHLAYGKRGF 
P PS VPADAWQYDVEL I AL I RANYWLKL VKG I LP LVGMAMVP ALLGL I G YHLYRKANRPKVS 
KKKLKEEKRNKSKKK 
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FTOTJRE 54 

CCCGGGAACGTGTTCCTGGCTGCCGCACCCGAACAGCCTGTCCTGGTGCCCCGGCTCCCTGC 
CCCGCGCCCAGTCATGACCCTGCGCCCCTCACTCCTCCCGCTCCATCTGCTGCTGCTGCTGC 
TGCTCAGTGCGGCGGTGTGCCGGGCTGAGGCTGGGCTCGAAACCGAAAGTCCCGTCCGGACC 

CTCCAAGTGGAGACCCTGGTGGAGCCCCCAGAACCATGTGC^ 

CACGCTTCACATACACTACACGGGAAGCTTGGTAGATGGACGTATTATTGACACCTCCCTGA 
CGAGAGACCCTCTGGTTATAGAACTTGGCCAAAAGCAGGTGATTCCAGGTCTGGAGCAGAGT 
CTTCTCGAC^TGTGTGTGGGAGAGAAGCGAAGGGCy^TCATTCCTTCTCACTTGGCCTATGG 
AAAACGGGGATTTCCACCATCTGTCCCAGCGGATGCAGTGGTGCAGTATGACGTGGAGCTGA 
TTGCACTAATCCGAGCC^CTACTGGCTAAAGCTGGTGAAGGGCATTTTGCCTCTGGTAGGG 
ATGGCCATGGTGCCACCCTCCTGGGCCTCATTGGGTATCACCTATACAGAAAGGCCAATAGA 
CCOVAAGTCTCCAAAAAGAAGCTC^AGGAAGAGAAACGAAACAAGAGCAAAAAGAAATAATA 



AATAATAAATTTTAAAAAACTTA 
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FIGURE 55 

CCGAAAGTCCCGTCCGGACCCTCCAAGTGGAGACCC^ 

GAGCCCGCTGCTTTTGGAGACACXjCTTCACATACACTA 

TATTATTGACACCTCCCTGACC^GAGACCCT^ 

TTCC^GGTCTGGAGCAGAGTCTTCT^ 

CCTTCTCACTTGGCCTATGGAAAACGGGGA 

GCAGTATGACGTGGAGCTGATTGCACTAATCCGAGCCA^ 

GCATTTTGCCTCTGGTAGGGATGGCCATGGTGCCAGCCCTCCTGGGCCT 

CTATACAGAAAGGCCAATAGACCCAAAGTCTCCAAAAAGAAGCT 

C^GAGCAAAAAGAAATAATAAATAATAAATTTTAAAAAACn^AA^ 
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FTCTJRE56 

CTGCTGC^TCCGGGTGTCTGGAGGCTGTGGCCGTTTTGTTTTCTTGGCTAAAATCGGGGGAG 
TGAGGCGGGCCGGCGCGGCGCGACACCGGGCTCCGGAACCACTGCACGACGGGGCTGGACTG 
ACCTGAAAAAAA^TCTGGATTTCTAGAGGGCTTGAGATGCTCAGAATGCATTGACTGGGGG 

GAAAAGCGCAATACTATTGCTTCCATTGCTGCTGGTGTAOTATTTTTTACA 
TATCA.TAQAT6CAGCTGTTATTTATCCCACCATGAAAGATTTCAACCACTCATACCATGCCT 
GTGGTGTTATAGCAACCATAGCCTTCCTAATGATTAATGCAGTATCGAATGGACAAGTCCGA 
GGTGATAGTTACAGTGAAGGTTGTCTGGGTCAAACAGGTGCTCGCATTTGGCTT^ 

TTTCATGTTGGCCTTTGGATCTCTGATTGCATCTATGTGGATTCTTTTTGGA 
CTAAAGAAAAAGACATAGTATACCCTGGAATTGCTGTATTTTTCCAGAATGCCTTCATCTTT 

TTTGGAGGGCTGGTTTTTAAGTTTGGCCGCACTGAAGACTTATGGCAGTGAACACATCTGAT 
TTCCCACAGCACAACAGCCCTGCATGGGTTTGTTTGTTTTm 

TTGTAATGCCATTTTCTAAACTTATTTCTGAGTGTAGTCTCAGCTTAAAGTTGTGTAATACT 
AAAATCACGAGAACACCTAAACAACAACCAAAAATCTATTGTGGTATGCACTTGATTAACTT 
ATAAAATGTTAGAGGAAACTTTCACATGAATAATTTTTGTCAAATTTTATCATGGTATAATT 
TGTAAAAATAAAAAGAAATTACAAAAGAAATTATGGATTTGTCAATGTAAGTATTTGTCATA 
TCTGAGGTCCAAAACCACAATGAAAGTGCTCTGAAGATTTAATGTGTTTATTCAAATGTGGT 
CTCTTCTGTGTCAAATGTTAAATGAAATATAAACATTTTTTAGTTTTTAAAATATTCCGTGG 
TCAAAATTCTTCCTCACTATAATTGGTATTTACTTTTACCAAAAATTCTGTGAACATGTAAT 
GTAACTGGCTTTTGAGGGTCTCCCAAGGGGTGAGTGGACGTGTTGGAAGAGAGAAGCACCAT 
GGTCCAGCCACCAGGCTCCCTGTGTCCCTTCCATGGGAAGGTCTTCCGCTGTGCCTCTCATT 
CCAAGGGCAGGAAGATGTGACTCAGCCATGACACGTGGTTCTGGTGGGATGCACAGTCACTC 



CACATCCACCACTG 



WO 99/63088 PCTAJS99/122S2 

FIGURE 57 

MSGFLEGLRCSECIDWGEKRNTIASI AAGVLFFTGWWI I IDAAVI YPTMKDFNHSYHACGVT 

ATIAFLMINAVSNGQVRGDSYSEGOjGQTGARIWLFVGFMLAFGSLIASI^ 

DIVYPGIAVFFQNAFIFFGGLVFKFGRTEDLWQ 
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FTOITRE 58 



TTCTTGGCTAAAATCGGGGGAGTGAGGCGGGCCGGCGCGGCGCGACACCGGGCTCCGGAACC 
ACTGCACGACGGGGCTGGACTGACCTGAAAAAAATGTCrGGATTTCTAGAGGGCTTGAGATG 

CTCAGAATGCATTGACTGGGGGGAAAAGCGCAATACTATTGCTTCCATTC 

TATTTTTTACAGGCTGGTGGATTAT^^ 

TTQ^CCACTCATACCATGCCTGTGGTGTTATAGCAACCATAGCCTTCCTAATGATTAATGC 
AGTATCGAATGGACAAGTCCGAGGTGATAGTTACAGTGAAGGTTGTCTGGGTCAAACAGGTG 
CTCGC^TTTGGCTTTTCGTTGGTTTCATGTTGGCCTTTGGATCTCTGATTGCATCTATGTGG 
ATTCTTTTTGGAGGTTATGTTGCTAAAGAAAAAGACATAGTATACCCTGGAATTGCTGTATT 
TTTCCAGAATGCCITCATCTTTTTTGGAGGGCTGGTrrrrAAGTTTGGC 
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FIGURE 59 



PCT/US99/I22S2 



TGGACGGACCTGAAAAAAATGTTTGGATTTOTAGAGGGOTTGAGATGTTCAGAATGCATGAC 
TGGGGGAAAAGCGC^VATACTATTGCTTCGATTGCTC 

GTGGATTATCATAGATGCAGNTGTTATTTATCCCACCATGAAAGATTTCAACC^NTCATACC 

ATGCCTGTGGTGTTATAGCAACCATAGCCTTCin'AATGATTAATGGAGTATCGAATGGACAA 

GTCCGAGGTGATAGTTACAGTGAAGGTTGTTTGGGTCAAACAGGTGCT 

CGTTGGTTTC^TGTTGGCCTTTGGATCTCTGATTGCA 

ATGTTGCTAAAGAAAAAGACATAGTATACCCTGGAATTGN^ 

ATCTTTTTTGGAGGGCTGGTTTTTAAGTTTGGCC3CA 
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FIGURE 60 

GGACACCGGGTTCCG<^CCAATGCANGACGGGGTGGANTGACCTGAAAAAAATGTTTGGATT 
TTTAGAGGGCTTGAGATGNTCAGAATGGATTGACTGGGGGAAAAGCGCAATANTATTGCTTT 
CCATTGCTGCTGGTGTACTATTTTTTACAGGGTGGTGGATTATCmTAGATGC^GCroTTATT 
TATCCCACCATGAAAGATTTNAACCACTCATACCATGCCTGTGGTGTTATAGCAACCA.TAGC 
CTTCCTAATGATTAATGCAGTATCGAATGGACAAGTCCGAGGTGATAGTTACAGTGAAGGTT 

GTTTGGK3TCAAACAGGTGNTCGC^TTTGGCTTTTCGTTGGTT^ 

CTGATTGNATTCTATGCGGATTCTTCTTGGAGGTTATGTTGCTAAAGAAAAAGACATAGTAT 
ACCCrGGAATTNCTNTATTTTTCCAGAATGCC 
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FIGURE 61 

TAGAGGGCTTGAGATGCTCA^ 

ATTGNTGNTGGTGTANTATTTTTTTACAGGCTGGTGGAT 

ATCCCACCATGAAAGATTTNAACXIANTCATACCATGCCTGTGGTC 

TTCCTAATGATTAATGCAGTATNGAATGGACAA 

TTTGGGTCAAAGAGGTGNTNGCATTTGGCTTTTO 

TGATTGCATTTATGTGGATTNTTTTTGGAGGTTATGTTGCTAAAGNA 

CCTGT 
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FTOTJRE 62 

GGGAGGCTGTGNCCGTTTTGTTTTNTTGGCTAAAATCGGGGGAGTGAGGCGGCCCGGCGCGG 

CGNGACACCGGGTTCCGGGAACCATTGCACGACGGGGTGGACTGACCTGAAAAAAATGTTTG 

GATTTNTAGAGGGCTTGAGATGCTC^GAATGCATTGACTGGGGGGAAAAGCGCAATACTATT 

GCTTCCATTGCTGCIK3GTGTACTATTTTTTACAGGCTGGTGGATTATCATAGATGCAGCTGT 

TATTTATCCCACCATGAAAGATTTC^CCACTCATACCATGCCTGTGGTGTTATAGC^ACCA 

TAGCCTTCCTAATGATTAATGCAGTATCGAATGGACAAGTCCGAGGTGATAGTTACAGTGAA 

GGTTGTCTGGGTCAAACAGGTGCTCGCATTTGGCTTTTCGTTGGTTTCATGTTGGCCT 

ATNTCTGATTGCATCTATGTGGATTCTTTTTGGAGGTTATGTTGCTAAAGAAAAAGACATAG 

TATACCCTGGAATTGCTGTATTTTTCCAGAATGCCTTCATNTTTTTTGGAGGGCTG 
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PCT/US99/12Z52 



FIGURE 63 



CGACGCCGGCGTGATGTGGCTTCCGCTGGTGCTGCTCCTGGC^ 

TCTGCAAAGTTTACTTGGGACTATTCTCTGGCAGCTCCCC 

AAACGGCCCCCAGCGCCCCTGGTAACTGAC^ 

TTTTTCAGCCAACCAAGTGCCGGAGAAGCTGGATGTGGTGGTAAT^ 

GCCTGGCTGCAGCTGCAATTCTAGCTAAAGCTGGCAAGCGAGTCCT 

ACCAAGGCAGGGGGCTGCTGTCATACCTTTGGAAAGAATGGCCTTGAATTTGAC^ 

CC^TTACATTGGGCGTATGGAAGAGGGCAGCATTGGCCGTTTTATCTTG 

AAGGGCAGCTGGACTGGGCTCCCCTGTCCTCTCCTTTTGACATCATGGTACTGGAAGGGCCC 

AATGGCCGAAAGGAGTACCCCATGTACAGTGGAGAGAAAGCCTACATTCAGGGCCT 

GAAGTTTCCACAGGAGGAAGCTATCATTGACAAGTATATA^ 

GTGGAGCCCCTCATGCCATCCTGTTGAAATTCCTCCCATTC 

AGGTGTGGGCTGCTGACTCGTTTCTCTCCATTCCTTCAAG^^ 

GGTCCTGCAGCAGCTGGGGGCCTCCTCTGAGCTCCAGGCAGTACT 

CTTACGGTGTCACCCCC^CCACAGTGCCTTTTCCATGCACGCCCTGCTGGTCA 

ATGAAAGGAGGCTTTTATCCCCGAGGGGGTTCCAGTGAAATTGCCTTCCACACCATCCCTGT 

GATTGAGCGGGCTGGGGGCGCTGTCCTCACAAAGGCCACTG 

CAGCTGGGAAAGCCTGTGGTGTCAGTGTGAAGAAGGGGCATGAGCrGGTC 

CCC^TCGTGGTCTCCAACGCAGGACTGTTCAACACCT 

CCGCTGCCTGCCAGGTGTGAAGCAGC^CTGGGGACGGTGCGGCCCGGCTTAGGCATGACCT 
CTGTTTTC^TCTGCCTGCGAGGCACCAAGGAAGACCTGCATCTGCCGTCCACCAAC^ 
GTTTACTATGACACGGACATGGACCAGGCGATGGAGCGCTACGTCTCCATC 
GGCTGCGGAACAC^TCCCTCTTCTCTTCTTCGCTTTCCCA 

AGGACCGATTCGCAGGCCGGTCCACCATGATCATGCTCATACCCACTGCCTACGAGTGGTTT 
GAGGAGTGGCAGGCGGAGCTGAAGGGAAAGCGGGGCAGTGACTATGAGACCTTCAAAAACTC 
CTTTGTGGAAGCCTCTATGTCAGTGGTCCTGAAACTGTTCCCACAGCTGGAGGGGAAGGTGG 
AGAGTGTGACTGCAGGATCCCCACTCACCAACCAGTTCTATCTGGCTGCTCCCCGAGGTGCC 
TGCTACGGGGCTGACCATGACCTGGGCQ3CCTGCACCCTTGTGTGATGGCCTCCTTGAGGGC 
CCAGAGCCCCATCCCCAACCTCTATCrGACAGGCCAGGATATCTT 

GGGCCCTGCAAGGTGCCCTGCTGTGCAGCAGCGCCATCCTGAAGCGGAACTTGTACTCAGAC 
CTTAAGAATCTTGATTCTAGGATCCGGGCACAGAAGAAAAAGAATT^ 

AGTCAGAGGAATTTGCCCAATGGCTGGGGCATCTCCCTTGACTTACCCATAATGTCTTTCTG 
ncATTAGTTCCTTGCACGTATAAAGCACTCTAATTTGGTTCTGATGCCTGAAGAGAGGCCTAG 
TTTAAATCACAATTCCGAATCTGGGGCAATGGAA 

TCTTTACGCCTTTTATAACATGCCATCCXrTACTAATAGGATATTGACrTGGATAGCTTC 

TCTCATGACGAGCXXKZGCTCTGCATCCCTCACCCATGCCTCCTT^ 

ATATTCCATCTGTGKSATAGAACCCCTGGCAGTGTTG^ 

TGTCCTGAGGCTTCTGCTCTCATTCATTTAGTGCTACGCTGCA<^ 

GAAAAGGGAGACTAATGAGGCTTAACTCAAAACCTGGGCXjT 

GGTTTGGAGAGCTCTAGATCTCTTTTGTGCTGGGTTCAGT 

GCCTGTGTCTGGCCAGTGTGGTTCTGGAGCTTTGGGGTAACAGCAGGATCCATCAGTTAGTA 
GGGTGCATGTCAGATGATCATATCG^TTG^ 

TCGGGGTGGCAGCTGGTTCTCAATGTGCCAGCAGGGACTCAGTACCTGAG^ 
CTTATCCACCAAATACACAGGGAAGGGTGATGCAGGGAAGGGTGACATCAGGAGT 
TGGACTGGTAAGATGAATACTTTGCTGGGCTGAAGCAG^ 
CACAGCAGGGGACAGTGCAGGGAGGTGTGGGGTAAGGGAGGGAA 

AAGCCACGGAATGTGTGTGAAGCCCAGAAATGGCATTTGCAGTTAATTAGCACATGTGAGTC 
TTAGACAGGTAGGTGAATGCAAGCTCAAGGTTTGGAAAAATGACTTTTCAGTTATGTCTTTG 
GTATCAGACATACGAAAGGTCTCTTTGTAGTTCGTGT 
ATTCCATTGCTTTAAAAAAAAAAAAAAA 



wo9woo» xy/jl ° pct/usjmzib 

FTOTTRF, 64 

mwlplvllijwlllavlckvyi^ 

QVPEKLDVWIGSGFGGLAAAAILAKAGKRVLVLEQHTKAGGCCHTFGKNGLEFDTGIHYIG 

RMEEGSIGRFILDQITEGQLDWAPLSSPFDIMVLEGPNGRKEYPMYSGEKAYIQGLKBKFPQ 

EEAIIDKYIKLVKWSSGAPHAILLKFLPLPWQLLDRCX3LLTRFSPFLQASTQSLAEVLQQ 

LGASSELQAVLSYIFPTYGVTPNHSAFSMHALLVNHYMKGGFYPRGGSSEIAFHTIPVIQRA 

GGAVLTKATVQSVLLDSAGKACGVSVKKGHELVNIYCPIWSNAGLFNTYEHLLPGNARCL 

GVKQQLGTVRPGLGMTSVFICLRGTKEDLHLPSTNYYVYYDTDMDQAMERYVSMPREEAAEH 

I PLLFFAFPSAKDPTWEDRFPGRSTMIMLI PTAYEWFEEWQAELKGKRGSDYETFKNSFVEA 

SMSVVLKLFPQLEGKVESVTAGSPLTNQFYLAAPRGACyGADHDLGRLHPCVMASLRAQS P I 

PNLYLTGQDIFTCGLVGALQGALLCSSAILKRNLYSDLKNLDSRIRAQKKKN 
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FIGURE 6S 

GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGT^ 

GGGGTTGGCACCGGCCCCGAGAGGAGGi^CGG^ 

TGCGGTGCTGCTGAGCTTGGCCTCGGCX3TCCTCGGATC 

TAGATTCCAAGACTACTTTGACATCAGATGAGTC^ 

GTAGTTGCTGGTCAAATATTTCTTGATTCAG 

AGAGGAAGACAGCCTCAAGAGCCAAGAGGGGGAAAGTGTCACAGAAGATATCA 

AGTCTCCAAATCCAGAAAACAAGGACTATGAAGAGCCAA^ 

ACCGCCATTGAAGGGACAGCACATGGGX3AGCC 

GGAGTATGATGAATGTACATCAGATGGGAGGGAAGATGGCAGACTGTGGTGTGCTA 
ATGACTACAAAGCAGATGAAAAGTGGGGCTTTTGTGAAACTGAAGAAGAGGCTC 
CGGCAGATGCAGGAAGCAGAAATGATGTATCAAACTGGAATGAAAATCCT 
TAAGAAAAGCCAAAAAAGAGAAGCATATCGGTATCTCC^ 

CCAAAGCCCTGGAGAGAGTGTCATATGCTCTTTTATTTGGTGATTACTTGCCACAGAATATC 

CAGGCAGCGAGAGAGATGTTTGAGAAGCTGACTGAGGAAGGCTCTCCCAAGGGACA 

TCTTGGCTTTCTGTATGCCTCTGG 

ATTATACATTTGGAGCTCTTGGGGGCAATCT^ 

T^TGGAAGGCTAATAATATTAACATCAGAAGAATT^ 

TGAGCTTTCATGATCCAGATTTGCTTGT^ 

ATTCTTGTTAATGGATATAACACATGGAATCTAC^ 

ATTTTTCTTTAAAATGATTAGTTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATAAATGGC 

TCTTTTTAAkTTTTCTCTGAGTTGGAATTGTC^ 

TTTTAAAAATTTTTCTTTAGTTTTTCAA^ 

GA^TATTATACAATATTTTGCAACAATGCCCTAAGAATTGTTAAAATTCATGGA 

GTGCAGAATGACTCCAGAGAGCTCTACTTTCTGTTTTTTACTTTTCATGATTGGCTGTCTT^ 

CCATTTATTCTGGTCATTTATTGCTAGTGA 

CTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGGAAGATrAACTCA 

ATTATGTCTAAGATTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAA 
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MRWIGLTLLLCAVLLSLASASSDEEGSQDESLDSKTTLTSDESVKDHTTAGRVVAGQIFLD 
SEESELESSIQEEEDSLKSQEGESVTEDISFLESPNPENKDYEEPKKVRKPALTAIEGTAHG 
EPCHFPFLFLDKEYDECTSDGREDGRLWCATTYDYKADEKWGFCETEEEAAKRRQMQEAEMM 
YQTGMKILNGSNKKSQKREAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEK 
LTEEGSPKGQTALGFLYASGLGVNSSQAKALVYYTFGALGGNLIAHMVLVSRL 
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FIGURE 67 



CTTCCCAGCCCT6TGCCCCAAAGCACCTGGAGCATATAGCCTTGCAGAACTTCTACTTGCCT 
GCCTCCCTGCCTCTGGCCM^CCTGCOSGTGCCTCAGCrTCCTTCTGATGGGGACCrTCCT 
GTCAGTTTCCC^GACAGTCCTGGCCCAGCTGGATGCACTGCTGGTCTTCCCAGGC<^AGTGG 
CTCAACTCTCCTGCACGCTCAGCCCCCAGCACGTCACCATCAGGGACrACHMTGTGTCCTGKS 
TACCAGCAGCGGGCAGGCAGTGCCCCTCGATATCTCCTCTACTACCGCTCGGAGGAGGATCA 
CCACCGGCCTGCTGACATCCCCGATCX3ATTCTCGGCAGCCAAGGATGAGGCCCACAATGCCT 
GTGTCCTCACCATTAGTCCTOTGCAGCCTGAAGACGATCCGGATTACTACTGCT 

TACXMCTTTAGTCCCl^OGGGTGGGGTGTGAGATGGGTGCCTCCCCTCTGCCrc 

GCCCCTGACCTTC^GTCCCTTTTAAACTTTCTC^ 

TTAATAATATTCAACATGTCAACAAC 
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MACRCLSFLI^GTFLSVSQTVLAQLDALLVFPGQVAQLSCTLSPQHVTIRDYGVSWYQQRAG 
SAPRYLLYYRSEEDHHRPADIPDRFSAAKDEAHNACVLTISPVQPEDDADYYCSVGYGFSP 
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FIGURE 69 

GCCGCCCCGCCCCGAGACCGGGCCCGGGGGCGCGGGGCGGCGGGATGCGGCGCC03GGGCGG 

CGATGACCGCGGAGCGCACGCCGCGGGCCCGGCCCTGACCCO*^^ 

CCCGCCGAGGTCCGGACAGGCCGAGAT^CGCCGAG 

CGCTGCTGCTGGGGGCCTTCCCACCGGCCGCCGCCGCCCGAGGCCCC^ 

AAGGTGGTCCCACGGCAGGTGGCCCGGCTGGGCCGCACTC 

GGGGGACCCGCCGCCGCTGACCATGTGGACCAAGGATGGCCGCACCATC 

GCCGCTTCCGCGTGCTGCCGCAGGGGCTGAAGGTGAAGCAGGTGGAGCGGGAGGATG^ 

GTGTACGTGTGCAAGGCCACCAAGGGCTTCGGCAGCCTGAGCGTCAACTACACCCTCGTCGT 

GCTGGATGACATTAGCCCAGGGAAGGAGAGCCTGGGGCCCGACAGCTCCTCTX3GGG 

AGGACCCCGCCAGCCAGCAGTGGGCACGACCGCGCTTCACACAGCCCTCCAAGATGAGGCGC 

CGGGTGATCGCACGGCCCGTGGGTAGCTCCGTGCGGCTCAAGTGCGTGGCCAGCGGGCACCC 

TCGGCCCGACATCACGTGGATGAAGGACGACC^GGCCTTGACGCGCCCAGAGGCCGCTGAGC 

CCAGGAAG AAGAAGTGGAGACTG AGCCTGAAG AACCTG C£K3^ 

ACCTGCCGCGTGTCGAACCGCGCGGGCGCCATCAACGCCACCTACAAGGTGGATGTGATCCA 

GCGGACCCGTTCCAAGCCCGTGCTCACAGGCACGCACCCCGTGAACACGACGGTGGACITCJG 

GGGGGACCACGTCCTTCCAGTGCAAGGTGCGCAGCGACGTGAAGCCGGTGATCCAGTGGCTG 

AAGCGCX3TGGAGTACX3GCGCCGAGGGCCGCGACAACTCGACGATCGATC 

GTTTGTGGTGCTGCCCACGGGTGACGTGTGGTCGCGGCCCGACGGCTCCTACCTCAATAAGC 

TGCTCATCACCCGTGCCCGCCAGGACGATGCGGGCATGTACATCTGCCTTGGCGCCAACACC 

ATGGGCTACAGCTTCCGCAGCGCCTTCCTCACCGTGCTGCCAGACCCAAAACCGCCAGGGCC 

ACCTGTGGCCTCCTCGTCCTCGGCCACTAGC^^ 

CCGGCGCTGTCTTCATCCTGGGCACCCTGCTCCTGTGGCTTTGCCAGGCCCAGAAGAAGCCG 

TGCACCCCCGCGCCTGCCCCTCCCCTGCCTGGGCACCGCCCGCCGGGGACGGCCCGCGACCG 

CAGCGGAGACAAGGACCTTCCCTCGTTGGCCGCCCTCAGCGCTGGCCCTGGTGTGGGGCTGT 

GTGAGGAGCATGGGTCTCCGGCAGCCCCCCAGCACTTACT 

CCTAAGTTGTACCCCAAACTCTACACAGACATCCACACACACACACACAC^ 

AGACTCACACGTGGAGGGCAAGGTCCACCAGCACATCCACTATCAGTG CTAGA CGGCACCGT 

ATCTGCAGTGGGCACGGGGGGGCCGGCCAGACAGGCAGACTGGGAGGATGGAGGACGGAGOT 

GCAGACGAAGGCAGGGGACCCATGGCX3AGGAGGAATGGCCAGCACCCCAGGCAGT 

TGAGGCATAGCCCCTGGACACACACACACAGACACAC^ 

ACACATGCGCGCACACGTGCTCCCTGAAGGCACACGTACGCACACGCACATGCACAGATATG 
CCGCCTGGGCACAC^GATAAGCTGCCCAAATGCACGCACACGCACAGAGACATGCCAGAACA 
TACAAGGACATGCTGCCTGAACATAC^ 

CACAC^C^CTVCACGGATATGCTGTCTGGACGCACACACGTGCAGATATGGTATC 

CACGTGCAC^GATATGCTGCCTGGAGACACAGATAATGCTGC 

ATATTGCCTGGACACACACACACACACACGCGTG^^ 

ACATGCAGATATGCTGCCTGGACACACACTTCCAGACACACGTGCAC^GGCGCAGATATGCT 

GCCTGGACACACGCAGATATGCTGTCTAGTCACACAC 

ACACACGCATGCACAGATATGCTGTCCGGACACACACA 

ACACACAGAGATAATGCTGCCTCAACACTCACACACGTGCAGATATTGCCT 

TGTGGACAGATATGCTGTCTGGACA/TCCACACAC^ 

CACGCACACATGCAGATATGCTGCCTGK3GGACACACTTCCGGACACACATC 

GCAGATATGCTGCCTGGACACACACACAGATAATGCTGCCTGAACACT 

TATTGCCTGGACACACACATGTGCACAGATATGCTGTGTGGACATGCACACACGTGCAGATA 

TGCTGTCCGGATACACACGCACGCACACATGCAGATATGCTGCCTGGGCACACACTTCCGGA 

CACACATGCACACACAGGTGCAGATATGCTGCCTGGAC^ 

GAGGGTGTGCCGTGAAGCCTGCAGTACGTGTGCCGTGAGGCTCATAGTTGATGAGGGACTTT 
CCCTGCTCCACCGTCACTCCGCCAACTCTGTC^ 

CATCCCCGCCTCTGTCCCCTGGCCTTGGCGGCTATTTTTGCCACCTGCCTTGGGTGCCCAGG 

AGTCCCCTACTGCTGTGGGCTGGGGTTGGGGGCACAGCAGCCCCAAGCC^ 

CCCATGGCTAGTGGCTCATCCCCAGTGCATTCTCCCCCTGACACAGAGAAGGGGCCTTGGTA 

TTTATATTTAAGAAATGAAGATAATATTAATAATGATGGAAGGAAGACTGGGTTGC^ 

TGTGGTCTCTCCTGGGGCCGGGGACCCGCCTGGT^ 

GTCCAGGCCAGACACCACCCCCCACCCCACTGTCX3T(^ 

TGTAGAGTTTGAGCTGAAGCCCCGTATATTTAATTTATTTTGTTAAACAC 
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MT p S p L I^LPPLLLGAFPPAAAARGPPKMAD^ 

yreKDGRTIHSGWSRFRVLPQGLK^QVEREDAGVYVCKATNGPGSLSVNYTLVVLDDISPGK 
ESLGPDSSSGGQEDPASQQWARPRFTQPSKMRRRVIARPVGSSVRLKCVASGHPRPDITWMK 
DIX2ALTRPEAAEPRKKKWTLSLKNLRPEDSGKYTCRVSNRAGAINATYKVDVIQRTRSKPVL 
TGTHPVNTTVDFGGTTSFQCKVRSDVKPVIQi^KRVEYGAEGRHNSTIDVGGQKFVVLPTGD 
WSRPDGSYLNKLLITRARQDDAGMYICLGAinMGYSPRSAFLTVLPDPKPPGPPVASSSSA 
TSLPWWIGIPAGAVFILGTLLLWLCQAQKKPCTPAPAPPLPGHRPPGTARDRSGDKDLPS 
LAALSAGPGVGLCEEHGSPAAPQHLLGPGPVAGPKLYPKLYTDIHTHTHTHSHTHSHVEGKV 

HQHIHYQC 
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CCCAGCTGAGGAG^CCTGOTCAAGA 

CATTCCAGAGTCAGTGACTCTGTGAAGCACCCACATCTACCTCTTG^ 

TTGGGGGAAAGAT^TGGGGACCAAGGCCTGGGTGTTCTCCTTCCTO 

TCTGTGTTGGGGAGACAGACGATGCTCACCCAGTCAGTAAGAAGAGTCCAGCCTG 

GAACCCCAGCATCTTTGCCAAGCCTGCCGACACCCTGGAGAGCCCTGGTGAGTC 

GGTTCAAGATCGACTAGCCAGGCGGGAAGGGCGACTATGAGCGGCT^^ 

TACTATGGGGACCX3TGTATGTGCCCGTCCCCr<3CGGCTAGAGGCTCGGAC^ 

ACCTGCGGGCAGCACTGGCCAGGTGGTCCATGGTAGTCCCCGTGAGGGTTTCTGGTGCCTCA 

ACAGGGAGCAGCGGCCTGGCCAGAACTGCTCTAATTACACCGT^ 

GGATCCCTGCGCCGAGACACAGAGCGCATCTGGAGCCCATGGTCTCCCTGGAGCAA 

AGCTGCCTGTGGTCAGACTGGGGTCCAGACTCGCACAGSCATTTGC^ 

CGCTGTGCAGTGAGGCCAGCGAAGAGGGTCAGCACTGCATGGGCCAGGACTGTAC^ 

GACCTGACCTGCCCAATOGGCCAGGTGAATGCTGACTGTGATGCCTGCATGTC 

CATGCTTCATGGGGCTGTCTCCCTTCCCGGAGGTGCCCCAGCCT 

TCCTGACCAAGACGCCGAAGCTGCTGACCCAGACAGACAGTGATGGGAGATTCCGAATCCCT 

GGCTTGTGCCCTGATGGCAAAAGCATCCTGAAGATCACAAAGGTCAAGT 

ACTGACAATGCCCAAGACTAGCCTGAAGGCAGCCACCATCAAGGCAGAGT^ 

AGACTCCATACATGGTGATGAACCCTGAGACAAAAGCACGGAGAGCTGGGCAGAGCGTGTCT 

CTGTGCTGTAAGGCCACAGGGAAGCCCAGGCCAGACAA 

ATTGCTGGATCCTTCCCTCTACAAGCATGAGAGCAAGCTGGTGCTGAGGAAACTGCAGCAGC 

ACCAGGOTGGGGAGTACITTTGCAAGGCCCAGAGTGATGCTGGGGCTG^ 

GCCCAGCTGATTGTCACAGCATCTGATGAGACTCCTTGCAACCCAGTTCCTGAGAGCT 

TATCCGGCTGCCCCATGATTGCTTTCAGAATGCCACCAACTCCTTCTACTATGACGTGGGAC 

GCTGCCCTGTTAAGACTTGTGCAGGGCACCAGGATAATGGG 

CAGAACTGCTGTGGCATCTCCAAGACAGAGGAAAGGGAGATCCAGTGCAGTGGCTACACGOT 
ACCCACCAAGGTGGCCAAGGAGTGCAGCTGCCAGCGGTGTACGGAAACTCGGAGCATCGTGC 
GGGGCCGTGTCAGTGCTGCTGACAATGGGGAGCCCATGCGCTTTGGCCATGTGTACA 
AACAGCCGTGTAAGCATGACTGGCTACAAGGGCACTTTCACCCTCCATGTCCCCGAGGACAC 
TGAGAGGCTGGTGCTCACATTTGTGGACAGGCTGCAGAAGTTTGTCAACACCACCAAAGTGC 
TACCTTTCAACAAGAAGGGGAGTGCCGTGTTCCATGAAATCAAGATGCTTOT 
CCCATC^CTTTGGAAGCCATGGAGACCAACATCATCCCCCTGGGGGAAGTGGTTGGTC 
— GGGGATGGGTGAAGTGGAGATTGGATGGAGGAGT-T-TC 

TAGGAAAAGTGAAGGCCAGTGTGAGCTTCCTGGATCCCGGGAATATTTCCACAGCCACAGCT 

GCCCAGACTGACCTCAACTTCATCAATGACGAAGGAGA 

CATGTTCTCTGTGGACTTCAGAGATGAGGTCACCTCAGAGCCA^^ 

AGGTCCACCTTGACTCGACCCAGGTCAAGATGCCAGAGCACATATCCACAGTGAAACTCTGG 
TCACTCAATCCAGACACAGGGCTGTGGGAGGAGGAAGGTGATTTCAAATTTGAAAATGAAAG 
GAGGAACAAAAGAGAAGACAGAACCTTCCTGGTGGGCA 

TCTTTAACCTGGATGTTCCTGAAAGCAGGCGGTGCTTTGTTAAGGTGAGGGCCTACCGGAGT 
GAGAGGTTCTTGCCTAGTGAGCAGATCC^GGGGGTTGTGATC^ 

TAGAACTGGCTTCTTGTCCAACCCTAGGGCCTGGGGCCGCTTTGACAGTGTCATCACAGGCC 
CCAACGGGGCCTGTGTGCCTGCCTTCTGTGATGACCAGTCCCCTGATGCCTACTCTGCCTAT 
GTCTTGGCAAGCXrrGGCTGGGGAGGAACTGCAAGCAGTG 
AAATGCAATTGGCGTCCCTCAGCCCTATCTCAACAAGCTCAACTA 

AGGATCCACGGGTTAAAAAGACAGCTTTGCAGATTAGCATGGCCi\AGCCAAGGCCCAACTCA 
GCTGAGGAGAGCAATGGGCCCATCTATGCX?TTTGAGAACCrCCGGG 

ACCCAGTGCAGCCCACTTCGGGTTCTACCAGATTGAGGGGGATCGATATGACTACAACACAG 

TGCCCTTCAACGAAGATGACCCTATGAGGTGGACTGAAGACTATCTGGCATGGTGGCCAAAG 

CCGATGGAATTCAGGGCCTGCTATATCAAGGTGAAGATTGTGGGGCCACTGGAAGTGAATGT 

GCGATCCCGCAACATGGGGGGCACTCATCXX3CGGACAGTCGGGAAGCTGTATGGAATC<X3 

ATGTGAGGAGCACTCGGGACAGGGACCAGCCCAATGTCTGAGCTGCCTGTCTC 

TGCAGTGGGATGCTCTATGATCAGGACGGTGTGGAGCGCACCCTGGTGAAGGTCATCGGCGA 

GGGCAGCTGCCGTCGAGCCAGTGTGAACCCCATGCTGCATGAGTACCTGGTCAACC^ 

CACTTGCAGTCAACAACGACACCAGTGAGTAGACCATGCTGGGACCCTTGGACCCACT 

CACAACTATGGCATCTACACTGTCACrGACCAGGACCCrCX3 

CGGCCGG1K3CTTTGATGGCACATCG<3ATGGCTCCTCCAGAATCATGAAGAGC^ 

TAGCCCTCACCTTC^CTGTGTAGAGAGGCAAGTAGGCCK^CA 

CAAAGCACCCCAGCCCAGTCCCCTGCTGCAGGCACTGTCCAAGGAAGAGTGCCCTCGAGGAG 
<3CAGCAGCGAGCGAGCAGGGGTGGCCAGCGCCAGGGTGGAGTGGTC 
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CTAGAGTTGCTC^(^GCCCCTGATCAAC^GTTTTGTGGTACTTCAC^TCTTCTCCCCT 

Sm^WGACAGCCATTGTGAGACTGATGC^CAAACTGTCACTTGGTTAA 

™^^ CGT ^^ GCTTGa ^ TTTCTTC ATGCCTTTACTTACTT^^^CTA 

^GA^GGCACGTGGCCCCCACAATGGCACAATAAAGCCCCTTTGTGAAACTGTTCTTTAAA 

TGA^^AAGAAATTGGCCACTGGTAAAACTCTGCAGCTTCAACTGTACTTCATTTAATC 

STTA^GCAAAmkcrrTCCTCTTCTTTTTGCATGGTTTTGCCCAC 

Stctcatg^aagatcaaataaccaa^ 

^TAGGCAAGCCTTGATCATAGTTCATACATATAAATGGTGGTGAAATAAAGAAATAAAACA 

MAC^AC^GAAATGTAAATAACTTATTTATTTCTTTGCTAAATTTGGAATO 

GC^CAT^CAAAGTTAAGCTATTAAATATAGGGTGATCATAGTTC 

gaa^tctcctggtatccacaattacaccaggttgctaactgtatttgtac^tttccct™ 

CTCTGTATTTCGAAAAAA 
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FIGURE 72 

MVGTKAWVFSFLVLEVTSVLGRQTMLTQSVRRVQPGKKNPSIFAKPM 

DYPGGKGDYERLDAIRFYYGDRVCARPL^ 

RPGQNCSNYTVRFLCPPGSLRRDTERIWSPWSPWSKCSAACGQ^ 

EASEEGQHCMGQDCTACDLTCPMGQVNADCDACMCQDFMLHGAVSLPGGAPASG 

TPKLLTQTDSDGRFRI PGLCPDGKS ILKITKVKFAP I VLTMPKTSLKAATIKAEFVRAETPY 

MVMNPETKARRAGQSVSL€CKATGKPRPDKYFWYHNDTLLDPSLYKHESK^ 

EYFCKAQSDAGAVKSKVAQLIVTASDETPCNPVPESYLIRLPHDCFQNATNSFYYDVGRCPV 

KTCAGQQDNGIRCRDAVQNCCGISKTEEREIQCSGYTLPTKVAKECSCQRCTET^^ 

SAADNGEPMRFGHVYMGNSRVSMTGYKGTFT^ 

KKGSAVFHEI KMLRRKEPITLEAMETO^ 

KASVTFLDPRNISTATAAQTDLNFIND^^ 

DSTQVKMPEHI STVKLWSLNPDTGLWEEEGDFKFE^QRRNKREDRTFLVGNLE I RERRLFNL 
DVPESRRCFVKVRAYRSERFLPSEQIQGWISVINLEPRTGFLSNPRAWGRFDSVITGPNGA 
CVPAFC3)DQSPDAYSAYVLASIiAGEELQAVESSPK^ 

VKKTAFQISMAKPRPNSAEESNGPIYAFENLRACEEAPPSAAHFRFYQIEOTRYDYNTVPFN 
EDDPMSWTEDYLAWWPKPMEFRACYIKVKIVGPLEVNVRSRNMG^ 
TRDRDQPNVSAACLEFKCSGMLYDQDRVDRTLVKVIPQGSCRRASVNPMLH 
NOTTSEYTMLAPLD^ 

FNCVERQVGRQSAFQYLQSTPAQSPAAGTVQGRVPSRRQQRASRGGQRQGGVVASLRFPRVA 
QQPLIN 
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CTGCAAGTTGTTAACGCCTAACACACAAGTATGTTAGGCTTCCACCAAAGTCCTCAATATAC 

CTGAATACGCACAATATCTTAACTOTCATATTTGGTTTTGGGATCTGCTTTGAGGTCCCAT 

CTTCATTTAAAAAAAAATACAGAGACCTACCTACCCGTACGCATACATACATATGTGTATAT 

ATATGTAAACTAGACAAAGATCGCAGATCATAAAGCAAGCTCTGCTTTAGTTTCCAAGAAGA 

TTACAAAGAATTTAGAGAXaTATTTGTC^GATCCCTGTCGATTCATGCCCTTTGGGTTACG 

GTGTCCTCAGTGATGCAGCCCTACCCTTTGGTTTGGGGACATTATGATTTGTGTAAGACTCA 

GATTTACACGGAAGAAGGGAAAGTTTGGGATTACATGGCCTGCCAGCCGGAATCCACGGACA 

TGACAAAATATCTGAAAGTGAAACTCGATCCTCCGGATATTACCTGTGGAGACCCTCCTGAG 

ACGTTCTGTGCAATGGGCAATCCCTACATGTGCAATAATGAGTGTGATGCGAGTACCCCTGA 

GCTGGCACACCCCCCTGAGCTGATGTTTGATTTTGAAGGAAGACATCCCTCCAC^TTTTGGC 

AGTCTGCCACTTGGAAGGAGTATCCCAAGCCTCTCCAGGTTAACATCACTCrrGTCTTGGAGC 

AAAACCATTGAGCTAACAGACAACATAGTTATTACCTTTGAATCTGGGCGTCCAGACGAAAT 

GATCCTGGAGAAGTCTCTCGATTATGGACGAACATGGCAGCCCTATCAGTATTATGCCACAG 

ACTGCTTAGATGCTTTTCAC^TGGATCCTAAATCCGTGAAGGATTTATCACAGCATACGGTC 

TTAGAAATCATTTGCACAGAAGAGTACTCAACAGGGTATACAACAAATAGCAAAATAATCCA 

CTTTGAAATCAAAGACAGGTTCGCGCTTTTTGCTGGACCTCGCCTACGCAATATGGCTTCCC 

TOTACGGACAGCTGGATACAACC^^GAAACTCAGAGATTTCTTTACAGTCACAGACCTGAGG 

ATAAGGCTGTTAAGACCAGCCGTTGGGGAAATATTTGTAGATGAGCTACACTTGGCACGCTA 

CTTTTACGCGATCTCAGACATAAAGGTGCGAGGAAGGTGCAAGTGTAATCTCCATGCCACTG 

TATGTGTGTATGACAACAGCAAATTGACATGCGAATGTGAGCACAACACTACAGGTCCAGAC 

TGTGGGAAATGCAAGAAGAATTATCAGGGCCGACCTTGGAGTCCAGGCTCCTATCTCCCCAT 

CCCCAAAGGC^CTGCAAATACCTGTATCCCCAGTATTTCCAGTATTGGTACGAATGTCTGCG 

ACAACGAGCTCCTGCACTGCCAGAACGGAGGGACGTGCCACAACAACGTGCGCTGCCTGTGC 

CCGGCCGCATACACGGGCATCCTCTGCGAGAAGCTGCGGTGCGAGGAGGCTGGCAGCTGCGG 

CTCCGACTCTGGCCAGGGCGCGCCCCCGCACGGCACCCCAGCGCTGCTGCTGCTGACCACGC 

TGCTGGGAACCGCCAGCCCCCTGGTGTTCTAGGTGTCACCTCCAGCCACACCGGACGGGCCT 

GTGC CGTGGGGAAG CAGACACAAC CC AAACATTTGCTACTAACATAGGAAACACACACATAC 

AGACACCCCCACTCAGACAGTGTACAAACTAAGAAGGCCTAACTGAACTAAGCCATATTTAT 

CACCCGTGGACAGCACATCCGAGTCAAGACTGTTAATTTCTGACTCCAGAGGAGTTGGCAGC 

TGTTGATATTATCACTGCAAATCACATTGCCAGCTGCAGAGCATATTGTGGATTGGAAAGGC 

TGCGACAGCCCCCCAAACAGGAAAGACAAAAAACAAACAAATCAACCGACCTAAAAACATTG 

GCTACTCTAGCGTGGTGCGCCCTAGTACGACTCCGCCCAGTGTGTGGACCAACCAAATAGCA 

TTCTTTGCTGTCAGGTGCATTGTGGGCATAAGGAAATCTGTTACAAGCTGCCATATTGGCCT 

GCTTCCGTCCCTGAATCCCTTCCAACCTGTGCTTTAGTGAACGTTGCTCTGTAACCCTCGTT 

GGTTGAAAGATTTCTTTGTCTGATGTTAGTGATGCACATGTGTAACAGCCCCCTCTAAAAGC 

GC^GCCAGTCATACCCCTGTATATCTTAGCAGCACTGAGTCCAGTGCGAGCACACACCCAC 

TATACAAGAGTGGCTATAGGAAAAAAGAAAGTGTATCTATCCTTTTGTATTCAAATGAAGTT 

ATTTTTCTTGAACTACTQTAATATGTAGATTTTTTGTATTATTGCCAATTTGTGTT ACCA ^ 

CAATCTGTTAATGTATCTAATTCGAATCAGCAAAGACTGACATTTTATTTTGTCCTCTTTCG 

TTCTGTTTTGTTTCACTGTGCAGAGATTTCTCTGTAAGGGCAACGAACGTGCTGGCATCAAA 

GAATATCAGTTTACATATATAACAAGTGTAATAAGATTCCACCAAAGGACATTCTAAATGTT 

TTCTTGTTGCTTTAACACTGGAAGATTTAAAGAATAAAAACTCCTGCATAAACGATTTCAGG 

AATTTGTATTGCAATTTCTTAAGATGAAAGGAACAG 

ACTGATTTCTGTGTGGACTGAGTACATTCAGCTGACGAATTTAGTTCCCAGGAAGATGGATT 
GATGTTCACTAGCTTGGACAACTTCTGCAAAATATGAGACTATTTCCACTTGGGAAAAATTA 

CAACAGCAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE Id 

MYLSRSLSIHALWVTVSSVMQPYPLWGHY^ 

VKLDPPDITTCDPPETFCAMGNPYMCNNECDASTPELAHPPEIMFDFEGRHPSTFWQSATWK 
EYPKPLQWITLSWSKTIELTDNIVITPESGRPDQMILEKSLDYGRTWQPYQYYATDCIiDAP 
HMDPKSVKDLSQHTVLEI ICTEEYSTGYTTNSKI IHFE IKDRFALFAGPRLRNMASLYGQLD 

TTKKLRDFFTVTDLRIRLLRPAVGEIFVDELHIJUIYFYAISDIKVRGRCKC^^ 
SKLTCECEHNTTGPDCOKCKKNYQGRPWSPGS YLPI PKGTANTCIPS I SS IGTNVCDNELLH 

C^NGGTangNVRCLCPAAYTGILCEKIACEEAGSCGSDSGQGA^ 
PLVF 
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CCCACGCGTCCGGGTGACCTGGGCCGAGCCCTCCCGGTCGGCTAAGATTGCTGAGGAGGCGG 
CGGGTAGCTGGCAGGCGCCGACTTCCGAAGGCCGCCGTCCGGGCGAGGTGTCCTGATGACTT 

CTCTTGTGGACC^I2TCQ3TGATCTTTTTTGCCTGCGTGGTACGGGTAAGGGATGG 
CCTCTCAGCCTCTACTGATTTTTACCACACCCAAGATTTTTTGGAATGGAGGAGACG^ 
AGAGTTTAGCCTTGCGACTGGCCCAGTATCCAGGTC0AGGTTCTGCA6AA 
AGTATACATTTTTCTTCTTTCX3GGGACGTGGCCTGCATGGCT 

AGCAGCCATGGCCrTCTGCTTCCTGGAGACCCTGTGGTGGGAATTCACAGCTTCCTATGACA 
CTACCTGCATTGGCCTAGCCTCCAGGCCATACGCTTTTCTTGA 

AAAGTGAAGTGGCATTTTAACTATGTAAGTTCCTCTCAGATGGAGTGCAGCTTGGAAAA 
TCAGGAGGAGCTCAAGTTGCAGCCTC^^ 

ATGGGGTGATGAATGGTCACACACCGATGCACTTGGAGCCTGCTCCTAAT^ 
CCAGTGACAGCCCTGGGTATCCTCTCCCTCATTCTCAACATCATGTGTGCTGCCCTGAATCr 

CATTCGAGGAGTTCACCTTGCAGAAC^TTCTTTACAGGATCCAAGGAGCTGGTTCTGCTGGT 
TGGACCAAACCTCG^GCCAGCCACCCCTGACCCAAATGAGGAGAGCTCTGATTCTCCCAT 
CCGGGAGCAGTGATGTCAAACTTCTGCTGCTGGGGAAATCTCATCAGCAGGGAGCCTGTGGA 
AAAGGGCATGTCAGTGAAATCTGGGAATGGCTGGATTCGGAAACATCTGCCCATGTGTATTG 
ATGGCAGAGCTGTTGCCCACAAGCGCCTTTTATTTAGGGTAAAATTAACAAATCCATTCTAT 
TCCTCTOACCCATGCTTAGTACATATGACCTTTAACCCTTACATTTATATGATTCTGGGGTT 
GCTTCAGAAGTGTTATTTCATGAATCATTCATATGATTTGATCCCCCAGGATTCTATTTTGT 
TTAATGGGCTTTTCTACTAAAAGCATAAAATACTGAGGCTGATTTAGTCAGGGCAAAACCAT 
TTACTTTACATATTCGTTTTCAATACTTGCTGTTCATGTTACACAAGCTT^ 

ttgtaacaataaatattttgagtaaataatcx;gtacattttaacaaactcagtagtacaacc 

TAAACTTGTATAAAAGTGTGTAAAAATGTATAGCCATTTATATCCTATGTATAAATTAAATG 
AGGTGGCTTCAGAAATGGCAGAATAAATCTAAAGTGTTTATTAAAAAAAAAA^ 

AAAAG 



WO 99/63088 PCT/US99/12252 

FIGURE 76 

MSVIFFACVVRVRIXSLPLSASTDFYHTQD^ 

SSFGDVACMAI CSCQCPAAMAFCFLETLWWEFTAS YDTTC IGLASRPYAFLEFDS I IQKVKW 

HFNYVSSSQMECSLEKIQEELKLQPPAVLTLEDTD^ 

LGILSLILNIMCAALNLIRGVHLAEH^ 



WO 99/63088 e>o/bVO . PCT/US99/I22SI 

FTGURE77 

TGCTTCCTGGAGACCCTGTGGTGGGAATTCACAGCTTCNTATGACACTACCTGCATTGGCNT 
AGCCTCCAGGCCATACGCTTITCTTGAGTTTGAC^GCATCATTCAGAAAGTGAAGTGGCATT 
TTAACTATGTAAGTTCCTNTCAGATGGAGTGCAGCTTGGAAAAAATTCAGGAGGAGCTCAAG 
TTGCAGCCTCCAGCGGTTCTCANTATGGAGGACACAGATGTGGCAAATGGGGT 
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FIGURE 78 

ctcagcggcxscttcctcgtagcgagcctagtggcgggtc^ 

cccgaccitaaagagtggggagcaaagggaggacagagcccttt^ 

cctocccctttaagggcggggcgtc 

tctgtcgcaggctgcgaggaaaggcccct^^ 

cctcccosctcgtcctccccgggccca^^ 

ttcaagcacctgactacgaagtgctatccgtgcgagaacagctattccacgag^ 

gagtgtattatatcaacacttctgtttgcaacactgtacatcctct 

ccgcttcaagaagcctgctgagttcac^ 

ttgosctcgagctgtgc^cccttac^ 

ttctccatcatcagcaatgaggtgctgctctccctgcctcgg^ 

caacggctccctcatccatggcctct<^aacc 

tcttcctcatgccctttgcatatttcttcactgag 

<^tgtcctgggccgggtctatgagac^gtggtgatgtt<3atgct 

aggtatggtgtgggtggc^tcagccattgtggacaag^^ 

atgacttttgggagtactatctcccctacctctactc 

ctcctcctggtgtgtactccactgggtctcgcccgcatgttctcc^ 

agtcaagccccggctgctggaaga<xttggaggagc7vgctgtact 

cagccctgacccgcaggatctctaatcctacttcotgctg^ 

ctacacagacaggtcctggctctgcagac^c^gagggtcctgct 

ttcagcctggcaacggaacctgggctaccccctggctatgctgtgcttgctg^ 

gcctgtctgtgctcattgtggccatccacatc 

ccccgaggcatgcagggtacctccttaggccaggtctccttctccaagctggg 

TGCCGTCATTCAGGTTGTACTCATCTTTTACCTAATGGTGTCCTCA^ 
GCTCTCCACTCTTCCGGAGCCTGCGGCCCAGATGGCACXSACACTG^ 
GGGAACTGTGTCTGTCTCCTGGTCCTAAGCTCAGC^ 
-GCT(^CTCGCCT-TGAGCTGGTGGGTGAGT-T-TG 
TTGTGTTCCTCTACAACGCAGCCTTTGCAG^ 
ACTGCAGCTGTGCXjGGCAGAGCTGATCCG 
CTCCGGTTTCCCCCAGGCATCTAGGAAGACCCAGCACCAGra 

AGGAAAAAACTGGACACTGCCATCTGCTGCCTAGGCCTGGAGGGAAGCCCAAGGCTACTT^ 

ACCTCAGGACCTGGAATCTGAGA<^TGGGTGGCAGAGGGGAGCAGAG 

GCATAATCTGAGCCAGAGTTTGGGACCAGGACCTCCTGCTTTTCCATA 

CAGCATGGGGTAGGGCTGGGTGACTGGGTCTAGCCCCTGATCCCAAATCTGTTTACACAT^ 
ATCTGCCTCACTGCTGTTCTGGGCCATCC^ 

AGGGTGGGGTAGGGGCAGGGAAAGGACTGGGCCAGGGCAGGCTCGGGAGATAG 

CTTGCCTCTGGCCCAGC^GAGCCTAAGCACTGTGCTATCCTGGAGGGGCTTTGGACC^CCTG 

AAAGACCAAGGGGATAGGGAGGAGGAGGCTTCAGCCAT^ 

AAAAAA 



FIGURE 79 

MEAP D YEVLS VREQLFHER I REC 1 1 STLLFATLY I LCH I FLTRF KKPAEFTTVDDEDATVNK 
IALELCTFTLAI ALGAVLLLPFS I ISNEVLLSLPRNYYIQWLNGSLIHGLWNLVFLFPNLSL 
IFLMPFAYFFTESEGFA6SRKGVLGRVYETVVMLMLLTLLVLGMVWVASAIVDKNKANRBSL 
YDFWEYYLPYLYSCISFLGVLLLLVCTPLGLARMFSVTGKLLVKPRLLEDLEEQLYCSAFEE 

AALTRRICNPTSC^PLDMELLHRQVIALQTQRVLLEKRRKASAWO^ 

GLSVLIVAIHILELLIDEAAMPRGMQGTSLGQVSFSKIX3SFGAVIQVVLIFYLMVSSVVGFY 
SSPLFRSLRPRWHDTAMTQIIGNCVCLLVLSSALPVFSRTLGLTRFDLU3DFGRFNWLGNFY 
IVFLYNAAFAGLTTLCLVKTFTAAVRAELIRAFGLDRLPLPVSGFPQASRKTQHQ 



WO 99/63088 • . PCT/US99/12252 

FIGURE 80 

GGCTGCCGAGGGAAGGCCCCTTGGGT^ 

GCTCGTCCTCCCCGGGCCCAGAGGCACCTC 

ACCTGACTACGAAGTGCTATCCGTGCGAGAAC^ 

TTATATCAACACTTCTGTTTGCAACACTGT^ 

AAGAAGCCTGCTGAGTTCACCACAGTGGATGATGAAGATGCCACCG 



WO 99/63088 £> L{ O PCT/US99/122S2 

FIGURE 81 

GACCGACCTTAAAGAGTGGGAGCAAAGGGAGGACAGAGCCTTTTAAAACGAGGCGGTGGTGC 

CTGCCCTTTAAGGGCGGGGCGTCCGGACGACTGTATCTGAGCCCCAGACTGCCCCGAGTTTC 

TGTCGCAGGCTGCGAGGAAAGGCCCCTAGGCTGGGTCTGGTGCTTGGCGGCGGCGGOT 

CCCCGTTGTCNTCCCCGGGCCCAGAGGCACCTCGGCTTCAGTCATGCTGAGCAGAGTATGGA 

AGCACCTGACTACGAAGTGCTATCCGTGCX3AGAACAGCTATTCCACGAGAGGATCCGCGAGT 

GTATTATATCAACACTTCTGTTTGCAACACTGTACATCNTCTGCCACATCTTCCTGACCCGC 

TTCAAGAAGCCTGCTGAGTTCACCACAGTGGATGATGAAGATGCCACCGTCAACAAGATTGC 

GCTCGAGCTGTGCACCTTTACCCTGGCAATTGCCCTGGGTGCTGTCCTGCTCCTGCCCTTCT 

CCATCATCAGCAATGAGGTGCTGCACTCCC 
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PCT/US99/12252 



FIGURE 82 

GATGTGCTCCITGGAGCTGGTGTGCAGTGTCCTGACT 

GGAATTGAGGAAACTTCTCTTTTG 

GGGTGATATTACTGGTCCTGGCTCCTGTCAGT^ 

ATTTTCCTCCAGCCTCCATGGACCACAGTC 

GGGATTTCGCTTCTACTCACCACAGAA^ 

TACTAAGAGAAACCCCAGACAATATCCTTGAGGTT^ 

GCCCAGGGCTCCCCTCTCAGTAGCCCTG^ 

TCATGCTGCCCAGGCTAATGTTGAACTCCT 

AAAGCGCTGGGATTACAGCTTCGCTGATCCTGCAAGCTCCACTTTCTGTGTTTGAAGGAGAC 

TCTGTGGTTCTG^GGTGCCGGGCAAAGGCGGAAGTAACACTGAATAATACTATOT 

TGATAATGTCCTGGCATTCCTTAATAAAAGAACTGACT^ 



WO 99/6308* ©<r/^ PCT/US99/12252 

FTftlJRE 83 

^LWVIULVIAPVSGQFARTPRPIIFI^PPVrTTVFQGERVTLTCKGFRFYSPQKTKWYHRYL 
GKEILRETPDNIIjEVQESGEYRCQAQGSPLSSPVHLDFSSEMGFPHAAQANVELI/SSSDLLT 
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FIGURE 84 

CAGAAGAGGGGGCTAGCTAGCTGTCTCTGCG^ 

GAGGCGGCCTCACAGGGCCGGGTGGGCTGGCGAGCCGACGC^^ 

GAGTGTGTGGAACAGGAC^GGGACAGAGGAACCA 

GCCTGTTGCTGCTATACCTCATC3GGGCGGTGATTGCCGGACGA 

GGGGTGCCTCGAAGTGCCTCTATAAAGGATATTAAAAAGGCCTATAGGAAACT 

GCTTCATCCCGACCGGAACCCTGATGATCt^ 

CTGCTTATGAGGTTCTGTCAGATAGTGAGAAACGGAAA^ 

GGATTAAAAGATGGTGATCAGAGCTCCCATGGAGAC^ 

TGGTTTCATGTTTGGAGGAACCCCTCGTCAGC^ 

TTATTGTAGATCTAGAAGTCACTTTGGAAGAAGTATATC 

AGAAACAAACCTGTGGCAAGGCAGGCTCXITGGCAAAC^ 

GCGGACCACCCAGCTGGGCCCTGGGGGCTTCCAAATGACCCAGGAGGTGGTC 

GCCCTAATGTCAAACTAGTGAATGAAGAACGAACGCTO 

AGAGACGGC^TGGAGTACCCCTTTATTGGAGAAGGTGAG 

AGATTTACGGTTCCGAATCAAAGTTGTCAAGCACCCAATATTTGAAAC^ 

TGTACACAAATGTGACAATCTCATTAGTTGAGTCACTGGTTGGCTTTGAGATGGATATT 

CACTTGGATGGTCACAAGGTACATATTTCCCGGGATAAGATC^ 

ATGGAAGAAAGGGGAAGGGCTCCCCAACTTTGACAACAACAATATCAAGGGCT 

TCACTTTTGATGTGGATTTTCGkAAAGAACAGTC 

CAGCTACTGAAACAAGGGTCAGTGCAGAAGGTATACAATG^ 

AATAAAATTGGACTTTGTTTAAAATAAGTGAATAAGCGATATTTATTATCTGCAAGGTl^ 
TTGTGTGTGTTTTTGTTTTTATrTTCAATATGCAAGTTAGGCTTAATTTTOT 
TCATCATGAAATGAATAAGAGGGCTTAAGAATTT 
AGCAAAAGGTTTACTAATACCTCTCCCTTT^ 

TTCAAGAATTAAAGCTGCAAGAGGACTCCAGGAGCAAAAGAAACAGA^ 

GTTGTTAGCAATTTCATTCAAAATGCCAACTGGAGAAGTCTGTTTTTAAATA 

TTATTTTTA 



WO99/63088 PCI7US99/122* 

FTQIJRE 85 

MAPQNLSTFCLLLLYLIGAVIAGRDPYKILGVPRSAS I KDIKKAYRKLALQLHPDRNPDDPQ 
AQEKFQDLGAAYEVLSDSEKRKQYDTYGEEGLKDGHQSSHGDIFSHPFGDFGFMFGGTPRQQ 
DRNIPRGSDIIVDLEVTLEEVYAGNFVEVVRNKPVARQAPGKRKCNaRQEMRTTQLGPGRFQ 
MTQEWCDECPNVKLVNEERTLEVEIEPGVRDGMEYPFIGEGEPHVTGEPGDLRFRIKVVKH 
PI FERRGDDLYTNVTI SLVESLVGFEMD ITHLDGHKVHISRDKITRPGAKLWKKGEGLPNFD 
NNNIKGSLI ITFDVDFPKEQLTEEAREGIKQLLKQGSVQKVYNGLQGY 
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FIGURE 86 

TGGGACCAGGGAACCCCGGGCCCCCCGGTGGAGNGCCTAACAGGCCGGTGGNTGCGACCG^ 
GOTCCGGGCX3GAGGAGGTTTTGAGGATT 

CCGCAGAACNTGAGCACNTTTTGCCTGTTGNTGNTATACTTC^ 
ACGAGATTTNTATAAGATTTTGGGGTGCCTNGAAGTGCCT 
CCTATAGGAAACTAGCCCTGCAGNTTTATCCX!^ 
GAGAAATTCCAGGATTTGGGTGCTGCTTATGAGGTTOT 

GTACGATAATTATGGTGAAGAAGGATTAAAAGATGGTNATCAGAGCTCCCATGGA 

TTTCACACTTNTTTGGGGATTTTGGTTTCATG 

AATATTCCAAGAG 



_ , PCT/US99/12252 

WO 99/63088 ^o/^>\0 

FTCTTRE 87 

GGCACGAGGCGGCGGGGCAGTCGCGGGATGCGCCCGGGAGCCACAGCCTGAGGCCCTCAGGT 

CTCTGC^GGTGTCGTGGAGGAACCrrAGCACCTGCCATCCrcrTCCCCAATT^ 

GCAGCTTTAGCCCATGAGQAGGATGTGACCGGGACTGAGTCAGGAGCCCTCTGGAAGCKCSG 

AC^CTGTGGTGATTGTTGCCATAGGTGTGCTGGCCACCATCTTTCTGGCTTCGTTTGCAGCC 

TTGGTGCTGGTTTGCAGGCAGCGCTACTGCCGGCCGCGAGACCTGCTGCAGCGCTATG^ 

TAAGCCCATTGTGGACCTCATTGGTGCCATGGAGACCCAGTCTGAGCCCTC'rGAGTTAGAAC 

TGGACGATGTCGTTATCACCAACCCCCACATTGAGGCCATTCTGGAGAATGAAGACTGGATC 

GAAGATGCCTCGGGTCTCATGTCCCACTGCATTGCCATCTTGAAGATTTGTCAC^ 

AGAGAAGCTTGTTGCCATGACAATGGGCTCTGGGGCCAAGATGAAGACTTCAGCCAGTGTCA 

GCGACATCATTGTGGTGGCCAAGCGGATCAGCCCCAGGGTGGATGATGTTGTGAAGTCGATG 

TACCCTCCGTTGGACCCCAAACTCCTGGACGCACGGACGACTGCCCTGCTCCTGTCTGTCAG 

TCACCTGGTGCTGGTGACAAGGAATGCCTGCCATCTGACGGGAGGCCTGGACTGGATTGACC 

AGTCTCTGTCGGCTGCTGAGGAGCATTTGGAAGTCCTTCGAGAAGCAGCCCTAGCTTCTGAG 

CCAGATAAACXTCCTCCCAGGCCCTGAATO 

ACAGGCCAGCAGCTAGCCATGAAGGCCCCTGCCGCCATCCCTGGATGGCTCAGCTTAGCCTT 
CTACTTTTTCCTATAGAGTTAGTTGTTCTCCATOGCTGGA(^GTTCAGCTGTGTGTGCATAG 
TAAAGCAGGAGATCCCCGTCAGTTTATGCCTCTTTTGCAGTTGCAAACTGTGGCTGGTGAGT 
GGCAGTCTAATACTACAGTTAGGGGAGATGCCATTCACTCTCTGCAAGAGGAGTATTGAAAA 
CTGGTGGACTGTCAGCTTTATTTAGCTCACCTAGTGTTTTC^^GAAAATTGAGCCACCX^CT 
AAGAAATCAAGAGGTTTCACATTAAAATTAGAATTTCTGGCCTCTCTCGATCGGTCAGAATG 

TGTGGOATTCTGATCTGCATTTTCAGAAGAGGAC^^ 

TTCTTTTGGCAAGACTTGTACTCTCrrCACCTGGCCTGTTTCATTTATTTGTATTATCT 
GGTCCCrGAGGCGTCTGGGTCTCTCCTCTCCCTTGCAGGTTTGGGTTTGAAGCTGAGGAACT 
ACAAAGTTGATGATTTCTTTTTTATCTTTATGCCTGCAATTTTACCTAGCTACCACTAGGT^ 
GATAGTAAATTTATACTTATGTTTCCCTCAAAAAAAAAAAAAAA 
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FIGURE 88 

METVVIVAIGVLATIFLASFAALVLVCRQRYCRPRDLLQRYDSKPIVDLIGAMETQSEPSEL 
ELDDWITNPHIEAILENEDWIEDAS6LMSHCIAILKICHTLTEKLVAMTM6S6AKMKTSAS 
VSDIIWAKRISPRVDDVVKSMYPPLDPKLLDARTTALLLSVSHLVLVTRNACHLTGGLDWI 
DQSLSAAEEHLEVLREAALASEPDKGLPGPEGFLQEQSAI 



WO 99/63088 2»«AvD PCT/CSW/1I2S 

FTOT7RE 89 

QCTTCATTTCTCCCX^CTCAGCTTCCCACCCTGGGCTTTCCGAGGTGCTTTCGCOGCTGTCC 

rr , arrafn , nr&c;CCATQATCTCCTTAACGGACACGCAGAAAATTGGAATGGGATTAAC^G^ 

TTTGGAGTGTTTTTCCTGTTCTTTGGAATGATTCTCTTTTTTGACAAAGCACTAC 

TGGAAATGTTTTATTTGTAGCCGGCTTGGCTTTTGTAATTGGTTTAGAAAGAACATTCAGAT 

TCTTCTTCCAAAAACATAAAATGAAAGCTACAGGTTTTTTTCTGGGTGGTGTATTTGTAGTC 

CTTATTGGTTGGCCTTTGATAGGCATGATCITCGAAATTTATGGATTTTTTCT 

GGGCTTCTTTCCTGTO^TGTTGGCriTTATTAGAAGAGTC 

TTACCTGGAATTAGATCATTTGTAGATAAAGTTGGAGAAAGCAACAATATGGTATAACAACA 
AGTGAATTTGAAGACTCATTTAAAATATTGTGTTATTTATAAAGTCATTTGAAGAATATTCA 
GGACAAAATTAAATTACATGAAATAGCTTGTAATGTTCTTTAGAG 

CCTAa^GTACCAGCAGCAAATTAGCAAAGAAGCAGTGAAAACAGGCTTCTACTCAAGTGA 
ACTAAGAAGAAGTCAGC^^GC^AACTGAGAGAGGTGAAATCCATGTTAATGATGCTTAAGAA 
ACTCTTGAAGGCTATTTGTGTTGTTTTTCCACAATGTGCGAAACTaVGCCATCCTTAGAGAA 
CTGTGGTGCCTGUTrClU^^'CTTTTTATTTTGAAGGCTCAGGAGCATCCATAGGCATTTGCT 
TTTTAGAAGTGTCCACTGCAATGGCAAAAATATTTC 

TGCATGAATTCGATTGGATTGTGTCATTTTAAAGTATTAAAACCAAGGAAACCCCAATTTTG 
ATGTATGGATTACTTTTTTTTGNGCNCAGGGCC 
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FIGURE 90 

MISLTDTQKIGMGLTGFGVFFLFFGMILFFDKALIAIG 

HKMKATGFFLGGVFWLIGTOLIGMIFEIYGF 

RSFVDKVGESNNMV 

Important features: 
Transmembrane domains: 

amino acids 12-30 (typell) , 33-52, 69-89 and 93-109 

N-myristoylation sites. 

amino acids 11-16, 51-56 and 116-121 

Aminoacyl - transfer RNA synthetases class -II protein. 

amino acids 49-59 



WO99K3088 ^ P ^899/12252 

FTOTJRE 91 

GAAGACGTGGCGGCTCTCGCCH^CTGTTTCCCGGCTTC^TTTCTCCCGACTC^GCTTCCC 
ACCNTGGGCTTTCCGAGGTGCTTTCGCCGCTGTCCCGACCACTGCAGCCATGATCrCCTTAA 

CGGA^CGCAGAAAATTGGAATGGGATTAAC^^ 

ATGATTCTCTTTTTTGACAAAGCACTACTGGCrATTGGAAATGTTTTATTTG 
GGCTTTTGTAATTGGTTTAGAAAGAACATTCAGATTCTTCTTCCAAAAACATAAAATGAAAG 

CTACAGGTTTTTTTCTGGGTGGTGTATTTGTAGTCCTTATTGK3TTGGCCTTTGATA 

ATCTTCGAAATTTATGGATTTTTTCTCTTGTTC 
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35 h s ° 

FIGURE 92 



GG C^CGAGGCTGAACCCAGCCGGCTCCATCTCAG 

GGCTGCCAGGAAGGAGACGCCTTCCTGAGTC^ 

CTGTGGGTAGTTATTTATTTCT^ 

TGAAGTCTCAGTTCCTGTGCCACCTGGTCTTCTGCTACGTCTTTATTO 
ATCAACACCATTCAGCTCTTCACTCTCCTCCT 

GATCAACTGCAGACTGTCCTATTGCATCTCAAGCCAGCTGGTGATGCT 

CGGGCACGGAATGCACCATCTTCACGGACCCGCGCGCCTACCTCAACT 

GCCATCGTGGTTCTCAACCACAAGTTTGAAATTGACTTTCTGTGTC 

ACGCTTTGGGCTGTTAGKSGGGCTCCAAGGTC^ 

TTATCGGCTGGATGTGGTACTTCACCGAGATGK5TC^ 

CGCAAGACGGTTGCCACCAGTTTGCAGCACCTCCGGGACTAC^ 

GATTCACTGTGAGGGCACACGGTTCACGGAGAAGAAGCATGAGATC^ 

GGGCCAAGGGGCTGCCTCGCCTCAAGCATCACCTGTTC 

ACCGTGAGGAGCTTGAGAAATGTAGTTTCAGCTGTATAT^ 

TAATGAAAATCCAACACTGCTGGGAGTCCTAAA 

TTAGGAGGATCCCACTGGAAGACATC<X:TGAAGACGATGACGAGTGCT 
AAGCTCTACCAGGAGAAGGAT<X:CTTT(^GGAGGAGTACT 



GACGCCCATGGTGCCCCCCCGGCGGCCCTGGACCCTCGTGAACTGGCTC 

TGGTGCTCTACCCTTTCTTCCAGTTCCTGGTCAGC^ 

CTGGCCAGCTT^TCCTCGTCTTC 

GACGGAAATTGACAAGGGCTCTGCOTACGGCAACTCTGAC^ 
GACT<^GGGAGGTGTCACCATCCGAA^ 

CCTTAGTGGGACACGGTGACAAAGGCTGGGTGAGCCCCTGCT^ 

CCTCTCCAGCCAGGGAGTCTGGTCTCAAGGCCGGATGGGGAGGAAGATGTT^ 

TTTTCCCCATGTGCTTTA<3T<3GGCTTTGGTTTTOT 

TGTGTCGTGAGTGTGAACTTTGTTCTGTGATCATAGAAAGGGTATTTTA<K3 
GGCAGGGCTGGGGACCGAAGGGGACAAGTTC^ 
AACCCTTGGTTGCCAGAGATAAAGTGAAAAGTGC^ 
CAAGAAAAAAAAATTAAAGTGCTTTTCTG^^ 



WO99/63088 ^/^O ^9*12252 

FIQITRE 93 

MDLAGLLKSQFLCHLVFCYVF IASGLI INTIQLFTLLLWP INKQLFRKINCRLSYC I SSQLV 
Mii^SGTECTIFTDPRAYLKYG^ 

EIAYVPIIGV^FTEMVFCSRKWEQDRIOVATSI^HLRDYPEKYFFLIHCEGTRFTEKI^ 
IS MQVARAKGLPRLKHHLLPRT^ 

YHADLYVRRI PLEDI PEDDDECSAWLHKLYQEKDAFQEEYYRTGTFPETPMVPPRRPWTLVN 
WLFWASLVLYPFFQFLVSMIRSGSSLTI^FILVFFVASVGVRWMIGVTEIDKGSAYGNSDS 

KQKLND 
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FIGURE 94 

CTGAGGCGGCGGTAGCATGGAGGG 
GCGCACTCGCTTTCCAGCACCTCAACACW 

GTAAAAGGTGAAGCCAAGAACAGCATTACTGATTCCCAAATCGATC 

TACAATTGACATTCAGAAATATATTCCATGCT 

GCGAAGTAAATGAGCAA<3CACTGAAGAAAATAT^ 

TGGTACAAATTCCGTCGTCATTCAGATCA<^ 

AAACTTGCAGGAGCATTTTTCAAACCAAGAC^ 

TAACAGAAAGCTGCTCTACTCATC<^CTGGAACATTCCTTATATAA 

TTTCACAGGGTACCTTTAGTGGTTGCC^ 

TGTATGAGGTTCCTGTATGTCCACTGGTm 

TTTTTGAAGAAGATGGATCCTTAAAGGAGGTACATAAGATAAATGAAATGTATGCTT 

CSVAGAGGAATTAAAGAGTATATGCAAAAAAGTGGAA^^ 

AGTAAAGGATGTAAACAGATTAAAACGAGAAATT<3^ 

CAGCAAGAGAGAAGAACATCCAAAAAGACCCTCAGGAGAACATTTT 

CGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTTATGT 

"TGTTTCTAAAAGTAGCTGTAACTACAACCA^ 

TGGTAGAACAC^CTGACATTCCTGAAGC 

AAAGCCTTAGACTTAGATGACAGATGGCAATTCAAGAGATCTCX^TTGTTAGATACA 
CAAACXSATCTAAAGCAAATACTGGTAGTAGTAACC^ 
CAGAAACAGATGAAGAAATTGAAAAGATGAAGGGTTTTGGT^ 
TTT TGA TCCTTTTAACCTTACAAGGAGATTTT^ 

TTCTATTGTTTTTACTATGTTGAGCTACTTGCAGTAAGTTCATTTGTTTT 
CTGTTTGCAGTAATACACAGATAACTCTTAGTGC^ 

TACACAGACACATTCTTTAGAATTGGAAAAGTGAGACCAGGC^ 

TCCCAGCACTTAGGGAAGACAAGTCAGGAGGATTC 

TGGGCAAOnATTGAGACCATCTCTATT^ 

TTTCAAAATATGGAAAGAAATTTATATGAAAATTTATCTGAGTCATTAAAATTCT 

TGATACTTTTTTAGAA<3TACATTATGGCTAGAGTTGCCAGATAAAATGCT 

ATAAATTTGCAAAACATCATCTAAAATTTAAAAAAAAAAAAAA^^ 



WO99/63088 ° PCT/US99A22S2 

FTCTJRE 95 

MEGESTSAVLSGFVLGALAFQHLlTroSDTEGFL 
KyiPCYQLFSFYNSSGEVNEQALKKILSNVKKWVVG 

FSNQDLVFLLLTPSIITESCSTHRLEHSLYKPQKGLFHRVPLWANLGMSEQLGYKTVSGSC 

MSTGFSRAVQTHSSKFFEEDGSLKEVHKINEMYASLQEELKSICKKVEDSEQAVDKLVKDVN 

RLKREIEIGIRGAQIQAAREKNIQKDPQENIFLCQALRTFFPNSEFLHSCVMSLKNRHVSKSS 

CNYNHHLDWDNLTLMVEHTDIPEASPASTPQIIKHKALDLDDRWQFKRSRLIJ^ 

NTGSSNQDKASKMSSPETDEEIEKMKGFGEYSRSPTF 
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FIGURE 96 

GGCACAGCC<3CGCGGC(^GGGCA^ 

GCGCAGGGCAGCCCAAGCAGCGCGC^GCGAACGCCC^ 

CCGCGGCGCCTCCCACCCTTCCCrCCTTCCCCGCGTCCCCGCCTCGC 

CCGGGTTCGCTGCCCCGCGAAACCGCG^GGTCAC 

CGCX5CCGCCTCCACGCCCTCCTTCTCCC 

GACGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCT 

ACTCCAACTCCTTCTCCCTCCAGCTCC^^ 

GCTGCOSTAGCGCCGCTTCCCGTCC^ 

CCATCGCACGGTT^ 

CTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTC 
CTTCAACAAGAACGATGCCCCCCTCCACGAGATCAACGGTGATC^ 
AGGGTTCTACCTGCTGCTCTCAAGAGATGGAGGAGAAGTAC^^ 
TTCAAAAGTGTGGTCAGCGAACAGTGCAATCATTTC 

gaagtttgatgaattcttcaaagaactacttgaaaa 
ttgtgaagacatatggccatttatacatgcaa 

gagttgaaacgttactacgtggtgggaaatgtgaacctggaagaaatgctaaatgact 

ggcix^cctcctggagcggatgttccgcctggtgaact 

atctggaatgtgtgagcaagtatacggagcagcto 

ttgaagctccaggttactcgtgcttttgtagcagcccgtactttcgct 

tgcgggagatgtcgtgagcaaggtctccgtggtaaac^ 

tgttgaagatgatctact<k:tccx^^ 

TACTGCTCAAACATCATGAGAGGCTGTTTGGC^ 
CAATTTCATAGATGCTTVTGCTC^ 

C<^TCATGGATCCCATCGATGTGAAGATTTCTGATGCTATTATGAACATGCAGGATAATAGT 

GTTCAAGTGTCTCAGAAGGTTTTCCAGGGATGTGGACCCCCCAAGCCCCT 

AATTTCTCCT 

AACGCCCAACCACAGCAGCTGGCACTAGTTTGGACCGA 
CTGAAACAGGCCAAGAAATTCTGGTCCTCCCTTC^ 
GGCTGCAGGAAACGGGAATGAGGATGACTGTTGGAATG^ 
TTGCAGTGACAGGAAATGGATTAGCCAACCAG 

AGCAAACCAGACATACTGATCCTTCGTCAAATCATGGCTCTTCGAGTC 

GAAGAATGCATACAATGGGAACGACGTGGACTTCTC 

AAGGAAGTGGAAGTGGCTGTGAGTATCAGCAGTGCCCT 

GACCATGCTGGGAAGAGTGCCAATGAGAAAGCCGACAGTGCTGG 

GGCCTACCTCCTCACTGTCTTCTGCATCTTGTTCCTGGTTATGCAG 

TCTCAAACTCTGAGAAAAAGTGTTCATCAAAAAGTTAAAAGGCA^ 

CCATCCTAGTGACTTTGCTTTTTAAATG 

CACTGGTTTAAGAAGTGCTGACTTTGTTTTCT 

CATTGAGTTGGTTCOTGCTCCCCCAAACCATGTTAAACGTGGCTAACAGTC 
CTATAGTTAGTTGTGCATTTGTGATTTTAT 
ATTTCGTTTGTGGGTTTTTTTTTCCAACTGTGATCTCGCCTTO 
GGTCCCTTCTTGGCACGTAACATGTACGTATTTCTGAAATATTAAATAGCTC 
GGTTTTATTTATCATGTTATCTTATTAAAAGAAAAAGCCCAAAAAGC m * 
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FTOTIRE 97 

MARFGL P ALLCTIAVLS AALLAAELKS KS CSE VRRL YVS KGFNKND APLHE I NGDHLK I CPQ 

GSTCCSQEMEEKYSLQSKDDFKSVVSEQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMF 

VKTYGHLYMQNSELFKDLFVELKRYYWGNVNLEEMLNDFWARLLERMFRLVNSQYHFTDEY 

LEC^SKYTEQLKPFGDVPRKLKMVTRAFVAARTFAO^IAVAGDWSKVSVVNPTAQCTHAL 

LKMIYCSHCIRGLVTVKPCYNYCSNIMRGCIiANQGDIjDFEWNNFIDAMLMVAERLEGPFNIES 

VMDPIDVKISDAIMNMQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEE 

RPTTAAGTSLDRLVTDVKEIOjKQAKKFWSSLPSNVCNDERMAAGNGNEDDCTWGKGKSRYL 

AVTGNGLANQGNNPEVQVDTSKPDILILRQIMALRVMTSKMKNAYNGNDVDFFDISDESSGE 

GSGSGCEYQQCPSEFDYNATDHAGKSANEKADSAGVRPGAQAYLLTVFCILFLVMQREWR 



WO»*MI8 AO-f/*MO PCT/US99/1SS2 

FTOTTRF 98 

CTCGCCCTCAAATGGGAACGCTG^ 

GACCTGAGTCATCCCCAGGGATCAGGAGCCTCCAGCAGGGAACCTT 

GCAACTTAC71GCTGCACCGACAGTTGCGATGAAAGTTOTAA 

TGCCACTAATGCTGATGTCCATGGTCTC 

AGGGACCGAGGCCAGGCTTCTAGGAGATGGCTCC&^ 

AGATTGGTTCCTGAGAGCCCCGAGAAGAAAAT^ 

AGTGCCCCTGTGATCATTTCAAGGGCAATGTGAAGAAAACAAGACAC 

AAGCCAAACAAGCATTCCAGAGCCTGCCAGCAATTTCTC^^ 

TGCTCTGCCTTTG2&GGAGCTCTGAGCGCCCACTCTTCCAATT 

GACAGTGAGCACACCTACCAGACACTCTTC 

CCTAAATCATTCCAGTGCTCTCAAAAAGCATGTTTTTCAAG^ 

TCTAGTGTCTTCTTCTCTCGTCAGTC^ 

AATTACCTGAAAGATTCCAGGAAACIK3TAG 

AATCAGGAAAGTAGCAAACAGAAGTCAATAAATATTTCT 



WO99/63088 ) PCT/US99/12252 

FTOITRE 99 

MKVLISSLLLLLPLMLMSWSSSLNPGVARGHRDRGQASRRWLQEGGQ 
KFMTVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 



WO 99/63088 . PCTAJS99/I2252 

FIGURE 100 

AATGGCTGTCTTAGTACTTCGCCrGACAGTTGTCCTGGGACTG 

GCTATGCAGACGACAAACCAGACAAGCCAGACGACAAGCCA 

AAGCCAGACTTCCCCAAATTCCTAAGCCTCCT 

GTTCATCCTCCGCTCGATGTCGAGGAGCA 

AACATTCATCAAA GTGA CATCCTCAG^ 

GCCAAATCCTGCTTTTCCAGTTTGGCT 

TCCCAACGAGTTCTCAGGATTCAGGCTCT^ 

CTGACTGCATTTTTGCTTTTAGAAAGT^ 

ATGGAGAGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



WO 99/63088 . , PCT/US99A2252 

• 4/2AO 



PTOTTRE 101 

MAVLVLRLTVVI/SLLVLPLTCbTADDKPDKPDDKPDDSGKDPKPDFPKFLSLLGTEIIENA^ 
FILRSMSRSTGFMEFDDNEGKHSSK 



WO 99/63088 / PCT/US99rt2252 

FIGURE 102 

(^ACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTC 

CAGAGCTGGTCTGCCATGGACATCCrGGTCCCACTCCTGCA 

CCTGCCCCTGCACCT<^TGGCTCTGCTGGGCroCTGGCAG 

CCTACCTGATGGCCGTGCTGACTCCCAAGAGCAACC^ 

CTCTTCAGCCAGATAAAGGGGCTTACAGGAGC 

CTGCGGAACCGGAGCCAACTTTCAGTTCTACCCACCGGGCTGC^ 

CAAATCCCCACTTTGAGAAGTTCCTGACAAAGAGCATGGCTC^ 

GAGCGGTTTGTGGTGGCTCCTGGAGAGGACATGAGACAGCTGGCTG^ 

GGTGGTCTGCACTCTGGTGCTGTGCTCTGTGC^^ 

GGAGAGTACTGAGACCGGGAGGTGTGCTCTTTTTC 

AGCTGGGCCTTCATGTGGCAGCAAGTTTTCX3AGCCCACCT 

CTGCCTGACCAGAGAGACCTGGAAGGATCTT(^ 

AACGACAGCXICCCTCCCTTGAAGTGGCTACCroTTCGG 

AAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCCT 

AGCCACCCACCAGCCTATCTATCTTCCACTGAGAGGG 

CATGTACC^CCTACTAGTCCCTCTCTCCC^ 

CCXSCCTTCGACA GTGAAAAAGCTCTACTTCTAGGCTGACC CAG GGAGGAAACACTAGGACC ^ 

TGTTGTATCCTCAACTGCAAGTTTCTGGACT 

CCTTTCCTTCCTTCCCATGGTAAAGCT^ 

CTCTAGGAACTGGTCACAAAAGTCATGGTGC^ 

CTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATG 

GCGAGAGCAAGACTCAAAGAGGCAGAGGTTTTGTTCTCAAATAT 

AACCACG 



WO 99/63088 / _ PCT/US99/12252 

FTOTJRE103 

MD I LVPIXQLLVLLLTLPLHLMALLGCWQPLCKSYF P YLMAVLT PKSNRKMESKKRELFSQ I 
KGLTGASGKVALLE1X3CGTGANFQFYPPGCRVTCIJDPNPHFEKFLTKSMAENRHLQYERFVV 
APGEDMRQLADGSMDVWCTLVLCSVQSPRKVLQEVRRVIJIPGGVLFFWEHVAEPYGSWAFM 
WQQVFEPTWKHIGDGCCLTRETWKDLENAQFSEIQMERQPPPLKWLPVGPHIMGKAVKQSPP 
SSKALICSFPSLQLEQATHQPIYLPLRGT 



WO 99/63088 



— AO?/~>~tO 

FIGURE 104 



PCT/US99/12252 



GTGG<^TTTATTTGAGTGCAAGATCGTT 

CAGATGTTGGGGCTTTGTCCGAACAGCTCCCCTC 

ACTAATATTTATATGACAGAAGAAAAAGA^TC^^ 

TCCTGGCTGTTGCTCTCTTCITACTGGTTTTG 

TTAAGGAATGAGGTTACAGATTCAGGAATTGTAGGGCCTC^ 

TGCTCTCCGACAT<SCAGTAGATGGGAGACAAGAGGAGAT^ 

AAGACAGGOTTGGGGGGGCCATTGCAGCTATAAACAGCATTCA 

GTGATTTTCTACATTGTTACTCTCAACAATACAGC^ 

TGATTCCCTGAAAAGCATCAGATACAAAATTGTCAATTTTGA 

AAGTAAAGGAGGATCCTGACCAGGGGGAATCC^ 

TTGCCAATTCTGGTTCCC^ 

AGGTGATATTCTTGCCCTTTACAATACAGCAC^ 

AAGATTGTGATTCAGCCTCTTVCTAAAGT^ 

ATTGGCTATCTTGACTATAAAAAGGAAAGAATTCGTAAGCTTTCCATGA 
CTCATTTAATCCTGGAGTTTTTGTTGCAAAC^ 
ACCAACTGGAAAAATGGATGAAACTCAATGTAGAAGA 
GGTAGCATCTK^CACCTCCTCTGCT 

TATGTGGAATGTCC^CACCTTGGTTCCAGTGCTGGAAAACGATATO 

AGGCTGCCAAGTTACTCCATTGtfl^TGGAC^ 

ACTGATGTTTGGGAAAAATGGTATATTCCAGACCCAAC^ 

ATATACCGAGATCTCAAACATAAAGTG^ 

GAAGTCCTGGAAGATAGCATGCATGGGAAGTAA^ 

GCAAGCCATGGAAAAAGATGTGTCAGCTAGGTAAAGATGACAAACTGCCCTC 
AGCTTCCC^ACAGACTATAGACTATAAAT^^ 

ACTACAATGCTGAATGACTGGAAAGAAGAACTGATATGGCTAGTTCAGCT 

TAATTC&AAACTGCTGTTGGT^^ 

ACATTTTTC 



WO 99/63088 



FTOITRE 105 

MSFRKVNIIILVIAVMiFIiVLHHNFLSLSSLL^ 

QEEIPWIAASEDRLGGAIAAINSIQHNTRSNVIFYIVTIjlWTADHLRSVnjNSDSL'KSIRYK 
IVNFDPKLLEGKVKEDPDQGESMKPLTFARFYLPILVPSAKKAIYMDDDVIVQGDILALYNT 
ALKPGHAAAFSEDCDSASTKWIRGAGNQYNYIGYLDYKKERIRKLSMKASTCSFNPGVFVA 
1^TEWKRQNITNQLEKWMKLNVEEGLYSRTLAGSITTPPI»LIVFYQQHSTIDPMWNVRHI/jS 
S AGKRYS PQFVKAAKLLHWNGHLKPWGRTAS YTDVWEKWY I PDPTGKFNLI RRYTE I SNI K 



WO 99/63088 - PCT/US99/12252 



FIGURE 106 



TGGTTTTTGCCCCATAAATTCCCTCAGCTTGAGCAGTTTGTTAAGGAATGAGGTTA<^GATT 
C^GGAATTNTAGGNCCTCTACCTNTAGANTTTGTCCCAAATGTTCT 
GGGAGACAAGAGGAGATTCCTGTGGTCATCGCnXK!ATOT 
TGCAGCTATAAAGAGCATTCAGCACAACA.CTCGNTCCAATGTGATTTTCTACA 

TC^CAATACAGCAGACCATNTCCGGTCCTGGNTCAACAGTGATTCCCTGAAA 
TAC^AAATTGTCAATTTTGACCCTAAACTTTT 

GGGGGAATCCATGAAACCTTTAACCTTTGCAAGGTTCTACTTGC 

CAAAGAAGGCCATATAC^TGGATGATGATGTAATTGTGC^GGTGATATTCTTGCCCTTTAC 

AATACAGCACTGAAGCCAGGACATGCAGCTGCATTTTCAGAA 

TAAAGTTGTCATCCGTGGAGCAGGAAA 



WO 99*3088 PCT/US99/12252 



FICIJRE 107 



CGACGCTCTAGCGGTTACCGCTGCGGGCTGGCTGGGCGTAGTGGGGCTGCGCGGCTGCCACG 

GAGCTAGAGGGCAAGTGTGCTCGGCCCAGCGTGCAGGGAACGCGGGCGGCCAGACAACGGGC 

TGGGCTCCGGGGCCTGCGGCGCGGGCGCTGAGCTGGCAGGGCGGGTCGGGGCGCGGGCTGCA 

TCCGCATCTCCTCCATCGCCTGCAGTAAGGGCGGCCGCGGCGAGCCTTTGAGGGGAACGACT 

TGTCGGAGCCCTAACCAGGGGTGTCTCTGAGCCTGGTGGGATCCCCGGAGCGTCACATCACT 

TTCCGATCACTTCAAAGTGGTTAAAAACTAATATTTATATGACAGAAGAAAAAGATGTCATT 

CCGTAAAGTAAACATCATCATCTTGGTCCTGGGCTGTTGCTCTCTTCTTACTGGTTTT^ 

CATAACTTCCTCAGCTTGAGGCAGTTTGTTAAGGAATGAGGTTACAGATTC^GGAATTGTAG 

GGCCTC^CCTATAGGACTTTGTCCCAAATGCTCTCCGACATGCAGTAGATGGGAGACAAGA 

GGAGATTCCTGTGGTCATCGCTGCATCTGAAGACAGGCTTGGGGGGGCCATTGCAGCTATAA 

ACAGCATTCAGCACAACACTCGCTCCAATGTGATTTTCTACATTGTTACTCTCAACAATACA 

GCAGACCATCTCCGGTCCTGGGCTCAACAGTGATTCCCTGAAAAGCATCAGATACAAAATTG 

TCAATTTTGACCCTAAACTTTTGGAAGGAAAAGTAAAGGAGGATCCTGACCAGGGGGAATCC 

ATGAAACCTTTAACCTTTGCAAGGTTCTACTTGCCAATTCTGGGTTCCCAGCGCAAAGAAGG 

CCATATACATGGATGATGATGTAATTGTGC^GGTGATATTCTTGCCCTTTACAATACAGCA 

CTGAAGCCAGGACATGCAGCTGCATTTTCAGAAGATTGTGATTCAGCCTCrrACTAAAGTTGT 

CATCCGTGGAGCAGGAAACCAGTACAATTACATTGGCTATCTTGACTATAAAAAGGAAAGAA 

TTCGTAAGCTTTCCATGAAAGCCAGCACTTGCTCATTTAATCCTGGAGTTTTTGTTGCAAAC 

CTGACGGAATGGAAACGACAGAATATAACTAACCAACTGGAAAAATGGATGAAACTCAATGT 

AGAAGAGGGACTGTATAGCAGAACCCTGGCTGGTAGCATCACAACACCTCCTCTGCTTATCG 

TATTTTATCAA<^GCACTCTACCATCGATCCTATGTGGAATGTCCGCCACCTTGGTTCCAGT 

GCTGGAAAACGATATTCACCTCAGTTTGTAAAGGCTGCCAAGTTACTCCATTGGAATGGACA 

TTTGAAGCCATGGGGAAGGACTGCTTCATATACTGATGTTTGGGGAAAAATGGTATATTCCA 

GACCCAACAGGCAAATTCAACCTAATCCGAAGATATACCGAGATCTCAAACATAAAGTGAAA 

CAGAATTTGAACTGTAAGCAAGCATTTCTCAGGAAGTCCTGGAAGATAGCATGCGTGGGAAG 

TAACAGTTGCTAGGCTTCAATGCCTATCGGTAGCAAGCCATGGAAAAAGATGTGTCAGCTAG 

GTAAAGATGACAAACTGCCCTGTCTGGCAGTCAGCTTCCCAGACAGACTATAGACTATAAAT 

ATGTCTCCATCTGCCTTACCAAGTGTTTTCTTACTACAATGCTGAATGACTGGAAAGAAGAA 

CTGATATGGCTAGTTCAGCTAGCTGGTACAGATAATTCAAAACTGCTGTTGGTTTO 
GTAACCTGTGGCCTGATCTGTAAATAAAACTTACATTTTTCAATAGGTAAAAAAAAAAAAAA 



AAAAAA 



WO 99/63088 




PCT/US99/12252 



FTGTJRE 108 

CTGCAGGTAGACIATCTCC^CTGCCCAGGAATCACTGAGCGTGCAGACAGCACAGCCTCCTCT 
GAAGGCCGGCCATACCAGAGTCCTGCCTCGGCATGGGCCTCACCATTGAGGCAGCTCCACTG 

TCTGTGCTGGTCTGAGGGTGCTGCCTGTCATGG^ 

GCCATCGTCTGCAACGGTCTCGTGGGCTTCTTGCTGCTGCTGCTCTGGGTCATCCTCTGCTG 
GGCCTGCCATTCTCGTCTGCCGACGTTGACTCTCTCTCTGAATCCAGTCCCAACTCCAGCCC 
TGGCCCCTGTCCTGAGAAGGCCCCACCACCCCAGAAGCCCAGCCATGAAGGCAGCTACCTGC 
TGCAGCCCTGAAGGCCCCTGGCCTAGCCTGGAGCCC^GGACC^GTCCAGCTCACCTAGAG 
CCTGGAATTAGGATCCCAGAGTTCAGCCy^CCTGGGGTCCAGAACTCAAGAGTCCJGCCTGCT 
TGGAGCTGGACCC^GCGGCCCAGAGTCTAGCCAGCTTGGCTCCAATAGGAGCrC^GTGGCCC 
TAAGGAGATGGGCCTGGGGTGGGGGCTTATGAGTTGGTGCTAGAGCCAGGGCCATCTGGACT 
ATGCTCCATCCCAAGGGCCAAGGGTCAGGGGCCGGGTCCACTCTTTCCCTAGGCTGAGCACC 
TCTAGGCCCTCTAGGTTGGGGAAGCAAACTGGAACCCATGGCAATAATAGGAGGGTGTCCAG 
GCTGGGCCCCTCCCCTGGTCCTCCCAGTGTTTGCTGGATAATAAATGGAACTATGGCTCTAA 

AAAAAAAAAAAAAAAAA 




WO 99/63088 / PCT/US99A2252 



FTOTJRE 109 

MGAAISQGALIAIVCNGLVGFLLLLLVTV'ILCWACHSRLPTLTLSIjNPVPTPALAPVLRRPHH 
pi^PAMKAATCCSPEGPWPSLEPRT 



WO 99/63088 , 

>va3A\o - PCT/US99/12252 

FTGIJRF.lip 

GTTTGAATTCCTTCAACTATACCCACAGTCCAWVAGCAGACTCACTG 

GTTCCTCCAAGCAAGTCATTTCCCTTATTTAACC(mTGTGTCCCTCAAA 

CTCCCTATTTGCATCTGTTTTGATAAATGATGTO 

TCATGTCGGGAAGAGATACAATCCTTGGCCTGTGTATCCT 
ATGATGTTTACCTT^GATTCATCACCACCCTTC 

TTTGGGATTGTTGTTTGTCTGCGGT6TTTTATGGTGGCTGTATTATGACTATA 

TCAGCATAGAATTGGACACAGAAAGGGAAAATATGAAGTG^TGC^^ 

TCCACAGGCATCACGGCAGTGCTGCTCGTCTT^^ 

GAC^GTTGAGCTTTTCCAAATCACAAATAAAGC^^ 

AGCCACTGTGGAC^TTTGCCATCCTCATTTTCITCTGGGTCOT 

agcctgggaactgcaggagctgcccagot^^ 

TTCGGGCATTCC^TACATGTGGTCGTACCATTTAATTGGCCT(^TCT^ 
TCCTTGTCTGCCAGCAAATGAC^^^ 

AAAAATGATCCrCCTGATCATCCCATOCTTTCGTCTCTCTCCATTCTCT^ 
AGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTG^ 

TGTACATGCAAAACGCACTGAAAGAACAGCAGCATGGTGCATTGTCCAGGTACCTGT^^ 
TGCTGCTACTGCTGTTTCTGGTGrCTTGACAAATAC<nX3CTC<^ 
-TACTACAACI^ATTAALTGGGAtAGATTTCT^ ~ 

TGTCC^AGAACTCAAGTCACTTTACATCTATTAACTGCTTTGGAGACTTCATAATTTTTCTA 
GGAAAGGTGTTAGTGGTGTGTTTC^CTGTTTTTGGAGGACT 

ATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCT 
GTGGAAACAAATGATGGATCGTCAGAAAAGCCCTACTTTATGGATCy^ 
CGTAAAAAGGAGCAACAAATTAAAC^TGCAAaSGCACAGC^^ 
AT(3AGGAGGGAACAGAACTC<*GGCCA^ 

GGAAAACATTTCCITCTAAGAGCCATTTACAGAATAGAAGATGAGACCACTAGA 
AGTGAATTTTTTTTTAAAAGACCTAATAAACCCTATTCTTCCTCAAAA 



WO 99/63088 , / ^ PCT/US99/12252 

FTflTTRE 111 

MS GRDT I LGLC I LALALSLAMMFTFRF I TTLLVH I F I S LV I LGLLFVCGVLWWLYYD YTNDL 
S I ELDTERENMKCVLGFAI VSTGI TAVLLVL I FVLRKR I KLTVELFQ I TNKAI S S APFLLFQ 
PLWTFAILIFFWVLWVAVLLSLGTAGAAQVMEGGQVEYKPLS6IRYMWSYHLIGLIWTSEFI 
IACQQMTIAGAVVTCYFNRSKNDPPDHPILSSLSILFFYHO^TVVKGSFLISVVRIPRIIVM 
YMQNALKEQQHGALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKIL 
SKNSSHFTSINCFGDFIIFLGKVLWCFTVFGGLMAFNYNRAFQVWAVPLLLVAFFAYLVAH 
SFLSWETVLDALFLCFAVDLETNDGSSEKPYFMDQEFLSFVKRSNKLNNARAQQDKHSLR^ 

EEGTELQAIVR 



WO 99/63088 / 

_^V\£/-b\0 PCT/US99/U252 

FIGURE 112 

TA^G^CTGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTCT^^ 
TGACTGGAGTACATTCAAACAAAGAAA<^^ 

CCTCAGATQACTGCGATGTCAAAGCCGGAAAGATCATCGA^CCTGAGTTC 
T C CAG CAGGATGCCAAGACC CCAAATACCATGTTTATGGCACTCACGTOTAT^ 
CCAGTGTGTGTGGCGCTXSCCGTAC^^ 

GTTQK3AAGGTTGCTGGACAGTCTGGTTACAAAGGGAGTTA 
ATCCCTACCACGATGGAGAGAATCCTTTATCGTCTTAGAAAGTAAAC 

cctacccatcagctcttacatactcatc^tcgaaaagt^ 
gcttctggctgtcactgtagctgtggccacccccaccaccttgcSa^c^at^ccttctc 

CTGCTTCTACCACCAGCATCCCCAGACCACAATCAGTG^ 
CTCTGGTCCACTGCCACCTACA^ 

AA<^CAAGATCCTTCAGGAGCTCCCTTCCAGAAACCTCTTGGAGC^ 

^ttccaaaagaagaattgagcacacagtctttggagcc^ 

AATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCT^ 

TC^TGGGTGTTGTCCAGTATGGAGA<^CCCTCCTACTCA^AAOTC^^^S 

aattctcgagatctgaagacagcca^^ 

aggtcgggccatctcctttgtgaccaagaactct^ 

^ctcccaatgtggtggi^t^^ 

TCAAC^CTTGCGAGAGAGTCAGGAATCAACATTTTCTTC^^^ 

gcttctactcgctcc^cgtgcagagctggtttggcctcc^ca^accctgS^ 

TGGCTTCGTC^TCGAOSGCTCCAGCAGTG^^ 
TTGTGACCAACCTCACCAAAGAGTTTGAGATTTCCGACAC 

cagtacacctacgaacagcggctggagtttgggttcgacaa^^^ 

?^^ C ^ T ^ GAGG GTGGGCra^ 



™^ tgagccaaaagg ctacatcatg™ 

gttttcaaaataaatgttcggaatacagtgcagcccttacgacaggot 

gtgagatttttaagttgttatttctgatttgaactctc^ 

gtcatgacaatgtaggaattgctgaattaaatgtttagaa^aS 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^^ 



WO 99/63088 ^i^/^O PCT/US99A2252 

FTOTTRE 113 

MRTVVLTMKASVIEMFLVLLVTGVHSNKETAKKI KRPKFTVPQINCDVKAGKI IDPEFI VKC 
PAGCQDPKYHVYGTDVYASYSSVCGAAVHSGVLDNSGGKILVRKVAGQSGYKGSYSNGVQSL 

SLPRVHIESFIVLESKPKKGVTYPSALTYSSSKSPAAQAGETTKAYQRPPIPGTTAQPVTLMQ 

L1AVTVAVATPTTLPRPSPSAASTTSIPRPQSVGHRSQEMDLWSTATYTSSQNRPRADP6IQ 

RQDPSGAAFQKPVGADVSLGLVPKEELSTQSLBPVSLGDPNCKIDLSFLIDGSTSIGKRRFR 

IQKQLLADVAQALDIGPAGPLMGWQYGDNPATHFinJCTHTNSRDLKTAIEKITQRGGLSW 

GRAISFVTKNFPSKANGNRSGAPNVVVVMVDGWPTDKVEEASRIjARESGINIFFITIEGAAE 

NEKQYVVEPNFANKAVCRTNGFYSLHVQSWFGLHKTWPLVKRVCDTDRLA 

GFVITCSSSVGTOTFRTVLQFVTNLTKEFEISDTDTRIGAVQYTYEQRL^ 
I^AIKRVGYWSGGTSTGAAINFALEQLFKKSKPNKRKLMILITDGRSYDDVRIPAMAAHLKG 

VITYAIGVAWAAQEELEVIATHPARDHSFFVDEFDNLHQYVPRIIQNICTEFNSQPRN 



FIGURE 114 

CAGGATGAACTGGTTGCAGTGGCTGCTGCTGCra^ 
^CTTTCCGGCTGCTCATCCOSCrCGGCCTCCTGTC 

GCGCCAGGTCCCGACGGCTCCGCGCCAGATCCCGCCC^CrACAGTTTTTCTCTC 

TOATGCACTGGACACCTTGCTGATTTTGGGGAATGTCT^ 

TGCTCCAGGACAGCGTGGACTTTGATATTGATGTGAACGCCTCT 

CGAGTGGTAGGAGGACTCCTGTCTGCTCATCTGCTCTCCAAGAAGGCTGGGGTGGAAGTAGA 

GGCTGGATGGCCCTGTTCCGGGCCTCTCCTGAGAATGGCTGAGGAG 

TCCC^GCCTTTCAGACCCCCACTGGCATGCCATATGGAACAGTGAACTTACI^CA 

AACCC^GGAGAGACCCCTGTCACCTGTACGGCAGGGATTGGGACCTT 

CACCCTGAGCAGCCTCACTGGTGACCCGGTGTTCGAAGATGTGGCCAGAGTGGCTTTGATGC 

GCCTCTGGGAGAGCCGGTCAGATATCGGGCTGGTCGGC^CCACATTGATGTGCTCACTGGC 

AAGTGGGTGGCCCAGGACGGAGGCATCGGGGCTGGCGTGGACTCCTACTTTGAGTACTTGGT 

GAAAGGAGCCATCCTGCTTCAGGATAAGAAGCTCATGGCCATGTTCCTAGAGTATAACAAAG 

CCATCCGGAACTACACCCGCTTCGATGACTGGTACCTGTGGGTTCAGATGTACAAGGGGACT 

GTGTCC^TGCCAGTCTTCCAGTCCTTGGAGGCCTACTGGCCTGGTCTTCAGAGCCTCATTGG 

AGACATTGACAATGCCATGAGGACCTTCCTCAACTACTACACTGTATGGAAGCAGTTTGGGG 

GGCTCCCCGAATTCTACAACATTCCTCAGGGATACACAGTGGAGAAGCGAGAGGGCTACCCA 

CTTCXXSCCAGAACTTATTGAAAGCGCAATGTACCTCTACCGTGCCACGGGGGATCO^ACCCT 

CCTAGAACTCGGAAGAGATGCTGTGGAATCCATTGAAAAAATCA.GCAAGGTGGAGTGCX3GAT 

TTGCAACAATaUUU3ATCTGCGAGACCACAAGCTGGACAACCGCATGGAGT<^TTCTTCCTG 

GCCGAGACTGTGAAATACCTCTACCTCCTGTTTGACCCAACOACTTCATCCACAACAATGG 

GTCCACCTTCGACGCGGTGATCACCCCCTATGGGGAGTGCATCCTGGGGGCTQGGGGGTACA 

TCTTCAACACAGAAGCTCACCCCATCGACCTTGCCGCCCTGCACTGCTGCCAGAGGCT 

GAAGAG^GTGGGAGGTGGAGGACrrGATGAGGGAATTCTACTCTCT 

GAAATTTCAGAAAAACACTGTTAGTTaSGGG^ 

TCTTCTCACCAGAAAACCATGACCAGGCAAGGGAGAGGAAGCCTGCCAAACAGAAGGTGCCA 
CITCTCAGCTGCCCCAGTCAGCCCTTCACCTCCAAGTTGGCATTACTGtSGA 

AGACTCCTCATAACCACTGGATAATTTTTTTATTTTTATTTTTTTGAG<3CTAA^ 
AATTGCTTTTGGCTATCATAAAA 



^„ MM> PCTAJS99/12252 
WO 99/63088 / 

FTOTTRE 115 

MPFRLLIPLGLLCALLPQHHGAPGPDGSAPDPAHYSFSLTLIDALDTLLILGNVSEFQRWE 
VLQDSVDFDIDVNASVFETNIRWGGLLSAHLLSKKAGVEVEAGWPCSGPLLRMAEEAARKL 
LPAFQTPTGMPYGTVNLLHGVNPGETPVTCTAGIGTFIVEFATLSSLTGDPVFEDVARVALM 
RLWESRSDIGLVGNHIDVLTGKWVAQDAGIGAGVDSYFEYLVKGAILLQDKKLMAMFLEYNK 

AIRNYTRFDDWyLWVQMYKGTVSMPVFQSLEAYWPGLQSLIGDIDNAMRTFLNYY^ 

GLPEFYNIPQGYTVEKREGYPLRPELIESAMYLYRATGDPTLLELGRDAVESIEKISKVECG 

FATIKDI^DHKLDN^ESFFIAETVKYLYLLFDPTNFIHNNGSTFDAVITPYGECILGAGGY 

IFNTEAHPIDLAALHCCQRLKEEQWEV^IMREFYSLKRSRSKFQKNTVSSGPWEPPARPOT 

LFSPENHDQARERKPAKQKVPLLSCPSQPFTSKLALLGQVFLDSS 



WO 99/63088 / PCT/US99/12252 

FIGURE 116 

AAAGTTACATTTTCTCTGGAACTCTCCT 

GGCAGAAAGGAGGGTGCTTCGGAGCCCGCCCTTTCT 

ATTCAGGCTTCGCTGCGACTCAGACCTCA^ 

GAGATGGACAGAATGCTTTATTTTGGAAAGAAA 

AATGCAGACTTTCACAATGGTTCTAGAAGAAATCTGGACA^ 

ACGCATTGATTCCATGTTTGCTCACAGATGAAGTGGCCATTCTGCCTGCC 

TCTGTACTCTCAACCAACATGAAGCATCTCTTGATGTGGAGC 

AACAGTGTACTATTCTGTCGAATACCAGGGGGAGTACGAGAGCCTGTACA 

GGATCCCCAGCAGCTGGTGCTCACTGACT 

ACGGCCACTGTGCCATACAACCTTCGTGTCAGGGCCACATTGGGCT 

GAGCATCCTGAAGCATCCCTTTAATAGAAACTCAACCATCCTTACCCGACCTGGGATTC 

TCACCAAAGATGGCTTCCACCTGGTTATTGAGCT 

CTTGTGGCCTACTGGAGGAGGGAGCCTGGTGCCGAGGAACATGTC?^AAATGG 

GGGTATTCCAGTGCACCTAGAAACCATGGAGC^ 

CATTCGTGAAGGCCATTGGGAGGTACAGCGCCTTC^^ 

GGAGAGGCCATTCCCCTGGTACTGGCCC^ 

GGTCGTGCCACTGTTCGTCTGGAAAATGGGCCGGCT 

TGGTCCTCCCAGACACCTTGAAAATAACCAATTC^CCCCAGAAGTTAATCAGCTGCAGAAGG 

GAGGAGGTGGATGCCTGTGCCACGGCTGTGATGTCTCCTGAGK5AACTCCTCAG 

CTCAIT^TTTGCGGAAGGGCCCAGGTGAAGCCGAG 

ATG^GGGGACAAGTTGTGTTTCTGTTTTCCGCCACGGACAAGGGATGAGAGAA 
GCCTGTTGTCTACAAGTCTAGAAGCAACCATCA 
TGACTGAGGCTTAGGGGATGTGACCTCTAGACTGGQ 
CTGGGAAAAGTGACTTCATCCCTTCGGTC^ 

TACACACCTGCTA^CAC^CACACACAGAGTCTCTCTCTATATATACACACGTACA^ 

TAC^CCCAGCACTTGCAAGGCTAGAGGGA^^ 

TGTTTCTGGAGAGC^GGACATAAATGTATGATC 

GGOTTGGAGAGCCCACTTTCCCAGAATAAT<XI!TTGAGA 

TGTTGAGTTCACTTCAAGCCCAATGCOGGTC 

AGGTGACCTGGAGGAAGGTC&CAGCCACACTG^ 

CATGAACTACTGTAAAGTGTTGACAGTGTCTGCACACTGC^ 

GTGCAATGCGACGAGAATGCAGAAGTCAGTAACATGTC 

TGTTGGTAAAGTACAGAATTCAGCAAATAAAAA 

AAAAAAAAAA 



WO^/63088 ■-D/'bVO. PCT/US " /12252 



FTQTTRE 117 

MQTFTMVLEEIWTSLFMWFFYALI PCLLTDEVAI LPAPQNLS VLSTNMKHLLMWSPVI APGE 
TVYYSVEYQGEYESLYTSHIWIPSSWCSLTEGPECDVTDDITATVPYNLRVRATWSQTSAW 
SILKHPFNRNSTILTRPGMEITKIXSFHLVIELEDIjGPQFEPLVAYWRREPGAEEHVKMVRSG 

GIPVHLETMEPGAAYCVKAQTFVKAIGRYSAFSQTECVEVC^ 
WPLFVWKMGRliQYSCCPVVVLPim.KITNSPQKLISCRREEVDACATAVMSPEE 



Important features: 
Signal peptide: 

amino acids 1-29 

Transmembrane domai n : 

amino acids 230-255 

N-glycosylation sites. 

amino acids 40-43 and 134-137 



Tissue factor proteins homology. 

amino acids 92-119 

Integrins alpha chain protein homology, 
amino acids 232-262 
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FIGURE 118 



PCT/US99/12252 



TCCTGCTGATGCACATCTGGGTTTGGCAAAAGGAGGTTC 

CCTGGCCGGCTCTAGAACAATTCAGGCTTCGCTGC^ 

TTCTGAAGAAAGATGGCTGAGATGACAGAATGCTTTATTTTGG 

TCAAACTGAGTCTACCAAATGCAGACTTTGAC 

TTTCATGTGGTTTTTCTACGCATTGATTCCA^ 

CTGCCCCTCAGAACCTCTCTGTACT 

GTCATCGOGCCTGGAGAAACAGTGrrACTATTCTGTCGAATACCAGKSGGGA 

GTACACGAGCCACATCTGGATCCCCAGC^GCT^ 

ATGTCACTGATGACATCACGGCCACTGTGCCATAC^ 

TGACAGACCTCAGCCTGGAGCATCCTGAAGCATCCCTTTAATAGAAACT 

CCGACCTGGGATGGAGATCACCAAAGATGGCTTNCACCTGGTTATTGAGCT 

GGCCCCAGTTTGAGTTCCTTGTGGCCTANTGGAGG 

GTTNGOJAACCCCTTGCGGCC 

ATACTCAATATGGACGAANTGCTATTGTCCACCT 



WO 99/63088 ^ VLg/«> » O PCTAJS99/12Z52 

FIOITRE 119 

CGGACGCGTGGGCCGCCACCTCCGGAACAAGCC^TSGTGGCCSGCGACGGTGGCAGCGGCGTG 

GCTGCTCCTGTGGGCTGCGGCCTGCGCGCAGCAGGAGCAGGACTTCTACGACTTCAAGGCGG 

TCAACATCCGGGGCAAACTGGTGTCGCTGGAGAAGTACCGCGGATCGGTGTCCCTGGTGGTG 

AATGTGGCCAGCGAGTGCGGCTTCACAGACCAGCACTACCGAGCCCTGCAGCAGCTGCAGCG 

AGACCTGGGCCCCCACCACTTTAACGTGCTCGCCTTCCCCTGCAACCAGTTTGGCCAACAGG 

AGCCTGACAGCAACAAGGAGATTGAGAGCTTTGCCCGCCGCACCTACAGTGTCTCATTCCCC 

ATGTTTAGCAAGATTGCAGTCACCGGTACTGGTGCCCATCCTGCCTTCAA6TACCTGGCCCA 

GACTTCTGGGAAGGAGCCCACCTGGAACTTCTGGAAGTACCTAGTAGCCCCAGATGGAAAGG 

TGGTAGGGGCTTGGGACCCAACTGTGTCAGTGGAGGAGGTCAGACCCCAGATCACAGCGCTC 

GTGAGGAAGCTCATCCTACTGAAGCGAGAAGACTTA^AACCACCGCGTCTCCTCCTCCACCA 

CCTCATCCCGCCCACCTGTGTGGGGCTGACCSVATGCAAACTC^AATGGTGCTTCAAAGGGAG 

AGACCCACTGACrrCTCCTTCCTTTACTCTTATGCCATTGGTCCCATC^TTCTTGTGGGGGAA 

AAATTCTAGTATTTTGATTATTTGAATCTTACAGCAACAAATAGGAACTCCTGGCCAATGAG 

AGCTCTTGACCAGTGAATCACCAGCCGATACGAACGTCTTGCCAACAAAAATGTGTGGCAAA 

TAGAAGTATATCAAGCAATAATCTCCCACCCAAGGCTTCTGTAAACTGGGACCAATGATTAC 

CTCATAGGGCTGTTGTGAGGATTAGGATGAAATACCTGTGAAAGTGCCTAGGCAGTGCCAGC 

CAAATAGGAGGCATTCAATGAACATTTTTTGGATATAAACCAAAAAATAACTTGTTA 

AAAAACTTGGATCCAACATGAATTTCCAGCCGATGATAATCCAGGCCAAAGGTTTAGTTGTT 

GTTATTTCCTCTGTATTATTTTCTTCATTACAAAAGAAATGCAAGTTCATTGTAACAATCCA 

AACAATACCTCACGATATAAAATAAAAATGAAAGTATCCTCCTCAAAAA 



WO 99/63088 I PCT/US99/12252 

FIGURE 120 

MVAAWAAAWLLLWAAAGAQQEQDFYDFKAVNIR^ 

HYRALQQI^RDLGPHHFNVl^PCNQFGQQEPDSNKEIESFARRTYSVSFPMFSKIAVTGTC 
MPAFKYIiAQTSGKEPTWNFWKYLVA^ 



WO 99/63088 ^ . / , _ PCT/US99/12252 

FTGIJRE 121 

CGGACGCGTGGGCGGGCCGGGACGCAGGGCAAAGCGAGCCATSGCTGTCTACGTCGGGATGC 

TG CGCCTGGGG AGGCTGTGCGCCGGGAGCTCGGGGGTGCTGGGGGC C CGGGC CGCCCT CTCT 

CGGAGTTGGCAGGAAGCCAGGTTGCAGGGTGTCCGCTTCCTCAGTTCCAGAGAGGTGGATCG 

CATGGTCTCCACGCCCATCGGAGGCCTCAGCTACGTTCAGGGGTGCACCS^AAAAGCATCTTA 

AC^GCAAGACTGTGGGCCAGTGCCTGGAGACCACAGCACAGAGGGTCCCAGAAGGAGAGGCC 

TTGGTCGTCCTCCATGAAGACGTCAGGTTGACCTTTGCCCAACTCAAGGAGGAGGTGGACAA 

AGCTGCTTCTGGCCTCCTGAGCATTGGCCTCTGCAAAGGTGACCGGCTGGGCATGTGGGGAC 

CTAACTCCTATGCATGGGTGCTCATGCAGTTGGCCACCGCCCAGGCGGGCATCATTCTGGTG 

TCTGTGAACCCAGCCTACCAGGCTATGGAACTGGAGTATGTCCTCAAGAAGGTGGGCTGCAA 

GGCCCTTGTGTTCCCCAAGCIAATTCAAGACCCAGCAATACTACAACGTCCTGAAGCAGATCT 

GTCCAGAAGTGGAGAATGCCCAGCCAGGGGCCTTGAAGAGTCAGAGGCTCCCAGATCTGACC 

ACAGTCATCTCGGTGGATGCCCCTTTGCCGGGGACCCTGCTCCTGGATGAAGTGGTGGCGGC 

TGGCAGC^CACGGCAGCATCTGGACCAGCTCCAATACAACCAGCAGTTCCTGTCCTGCCATG 

ACCCCATCAACATCCAGTTCACCTCGGGGACAACAGGCAGCCCCAAGGGGGCCACCCTCTCC 

CACTAC^CATTGTCAACAACTCCAAC^TTTTAGGAGAGCGCCTGAAACTGCATGAGAAGAC 

ACCAGAGCAGTTGCGGATGATCCTGCCCAACCCCCTGTACCATTGCCTGGGTTCCGTGGCAG 

GCACAATGATGTGTCTGATGTACGGTGCCACCCTCATCCTGGCCTCTCCCATCTTCAATGGC 

AAGAAGGCACTGGAGGCCATCAGCAGAGAGAGAGGCACCTTCCTGTATGGTACCCCCACGAT 

GTTCGTGGACATTCTGAACCAGCCAGACTTCTCCAGTTATGACATCTCGACCATGTGTGGAG 

GTGTCATTGCTGGGTCCCCTGCACCTCCAGAGTTGATCCGAGCCATCATCAACAAGATAAAT 

ATGAAGGACCTGGTGGTTGCTTATGGAACCACAGAGAACAGTCCCGTGACATTCGCGCACTT 

CCCTGAGGACACTGTGGAGCAGAAGGCAGAAAGCGTGGGCAGAATTATGCCTCACACX3GAGG 

CCCGGATCATGAACATGGAGGCAGGGACGCTGGCAAAGCTGAACACGCCCGGGGAGCTGTGC 

ATCCGAGGGTACTGCGTCATGCTGGGCTACTGGGGTGAGCCTCAGAAGACAGAGGAAGCAGT 

GGATCAGGACAAGTGGTATTGGACAGGAGATGTCGCCACAATGAATGAGCAGGGCTTCTGCA 

AGATCGTGGGCCGCTCTAAGGATATGATCATCCGGGGTGGTGAGAACATCTACCCCGCAGAG 

CTCGAGGACTTCTTTCACACACACCCGAAGGTGCAGGAAGTGCAGGTGGTGGGAGTGAAGGA 

CGATCGGATGGGGGAAGAGATTTGTGCCTGCATTCGGCTGAAGGACGGGGAGGAGACCACGG 

TGGAGGAGATAAAAGCTTTCTGCAAAGGGAAGATCTCTCACTTCAAGATTCCGAAGTACATC 

GTGTTTGTCACAAACTACCCCCTCACCATTTCAGGAAAGATCCAGAAATTCAAACTTCGAGA 

GCAGATGGAACGACATCTAAATCTGTGAATAAAGCAGCAGGCCTGTCCTGGCCGGTTGGCTT 

GACTCTCTCCTGTCAGAATGCAACCTGGCTTTATGCACCTAGATGTCCCCAGCACCCAGTTC 

TCAGCCAGGCACATCAAATGTCAAGGAATTGACTGAACGAACTAAGAGCTCCTGGATGGGTC 

CGGGAACTCGCCTGGGCACAAGGTGCCAAAAGGCAGGCAGCCTGCCCAGGCCCTCCCTCCTG 

TCCATCCCCCACATTCCCCTGTCTGTCCITGTGATTTGGCATAAAGAGCTTCTGTTTTCTTT 

GAAAAAAAAAAAAAAAA 



WO 99/63088 \ ° PCI7US99/12252 

FIGURE 122 

MAVYVGMLRLGRLCAGSSGVLGARAALSRSWQEARIjQGVRFLSSREVDRMVSTPIGGLSYVQ 
GCTKKHLNSKTVGQCLETTAQRVPEREALVVLHEDVRLTFAQLKEEVDKAASGLLSIGLCKG 
DRLGMWGPNSYAWVLMQLATAQAG I ILVSVNPAYQAMELEYVLKKVGCKALVFPKQFKTQQY 
YNVLKQ I CPEVENAQPGALKSQRL PDLTTVI SVDAPLPGTLLLDEWAAGSTRQHLDQLQYN 
QQPLSCHDPINIQFTSGTTGSPKGATLSHYNIVIWSNILGERLKIjHEKTPEQLRMILPNPLY 
HCLGSVAGTMMCLMYGATLILASPIFNGKKALEAISRERGTFLYGTPTMFVDILNQPDFSSY 
DI STMCGGVI AGSPAPPELI RAI INKINMKDLWAYGTTENS PVTFAHFPEDTVEQKAES VG 
RIMPHTEARIMNMEAGTLAKLNTPGELCIRGYCVMI/SYWGEPQKTEEAV^ 
MNEQGFCKIVGRSKDMIIRGGENIYPAELEDFFHTHPKVQEVQWGVKDDRMGEEICACIRL 

KDGEETTVEE I KAFCKGKISHFKI PKY I VFVTNYPLT I SGK IQKFKLREQMERHLNL 



WO 99/63088 / PCT/US99/I2252 

FTCTJRE 123 

C^CTCCAAC^TTTTAGGAGAGCGCCTGAAACTGCATGAGAAGACACCAGAGCAGTTGCGGA 
TGATCCTOCCCAACCCCCTGTACCATTGCCTGGGTTCCGTGGC^GGCAC^TGATGTGTCTG 
ATGTACG&TGCGACCCTCATCCTGGCCTCTCCCATCTT 

CATCAGC^GAGAGAGAGGCACCTTCCTGTATGGTACCCCCACGATGTTCXSTGGACATTCTGA 
ACCAGCCAGACTTCTCCAGTTATGACATCTCGACCATGTGTGGAGGTGTCATTGCTGGGTCC 
CCTGCACCTCCAGAGTTGATCCGAGCCATCATCAACAAGATAAATATGAAG6ACCTGGTQGT 
TGCTTTATGGAACCACAGAGAACAGTCCCGTGACATTCGCGCACTTCCCTGAGGACACTGTGG 
AGCAGAAGG(^GAAAGCGTGGGCAGAATTATGCCTCACACGGAGGCGCGGATCATGAACATG 
GAGGGAGGGACXSCTGGCAAAGCTGAAC^CGCCCGGGGAGCTGTGC^ 

CATGCTGGGCTACTGGGGTGAGCCTCAGAAGACAGAGGAAGCAGTGGATCAGGACAAGTGGT 
ATTGGACAGGAGATGTCGCCAC 



WO 99/63088 . /. . ~ PCT/US99/122S2 

FIGURE 124 

GAGCAGGACGGAGCCATGGACCCCGCCAGGAAAGCAGGTG 
AGGCTGGCTGCTGCTGCTGCTGCTTCGCGGAGGAGCGCAGGCCCTGGAGTGCT 
TGCAGAAAGCAGATGACGGATGCTCCCCGAACAAGATGAAGA 
GTGGACGTCTGCACCGAGGCCGTGGGG 

AGTGCGGGGTTGCGGTTCGGGACTCCCCGGCAAGAATGACCGCX3GCCTGGATCTT 

TTCTGGCGTTCATCCAGCTGCAGCAATGCGCTC^ 

ACCTCGCGGGCGCTCGACCCGGCAGGTAATGAGAGTC 

CTACAGOTGTGTGGGCCTGAGCCGGGAGGCGTGCCAGGGTACATCGCCGCCGGTCGTG^ 

GCTACAACGCCAGCGATGATGTCTACAAGGGCT 

GCTAATGTGACTGTGTCCTTGCCTGTCCGGGGCT^ 

TGGAGTAACAGGCCCAGGGTTCACGCTCAGTGGCTC^ 

CTGACCTCCGCAACAAGACCTACITCrCCCCTCGAATC 

CCAGAGCCCACGACTGTGGCCTCAACCACATCTC 

ACCCAC^TCCACCACCy^CCCATGCCAGCGCCAACCAGTC^GACT 

AACACGAGGCCTCCCGGGATGAGGAGCCCAGGTTGACTGGAGGCGCOT 

CGCAGCAATTCAGGGCAGTATCCTGCAAAAGG^ 



^GCTTCTCCACCTGGAAATTTCCCTCT^ 

CATCACTTCCTGTTCCCACCACTGGACTGG^ 

CAGTATCCCX!AGCTTCTGCTGCGCTGG1TTGCGGCTTTC 

ATTCTGCCAGGGGTGTTCTAGCTTTTTGAGGACAGCTCCTGTATCCTTCTCATC 

TCCGCTTGTCCTCTTGTGATGTTAGGACAGAGTGAGAGAAG 

AGAGAGAGGATGCTAAGCTTCCTACTCACTTT^ 

GGTGGGTGGGACAATGGCT€CC<^CTCTAAGCACTGCCTC€CCTACT 

GAATCGGTTCCCCATATGTCTTCCTTACT 

CCAATTCGCCCTATAGTGAGTCGTA 



WO 99/63088 „ I PCT/OS99/12252 

FIGURE 125 

fnjPARKAGAQAMIWTAGWLLLLLLRGGAQALECT^ 
BAVOAVETIHGQFSIAVRGCG^^ 

DPAG1TOSAYPPNGVECYSCVGLSREACQGTSPPWSCYNASDHVYKGCFDGNVTLTAANVTV 
SLPVRGCVQDEFCTRDGVTGPGFTLSGSCCQGSRCNSDLRNKTyFSPRI PPLVRLPPPEPTT 
VASTTSVTTSTSAPVRPTSTTKPMPAPTSQTPRQGVEHEASRDEEPRLTGGAAGHQDRSNSG 

QYPAKGGPQQPHNKGCVAPTAGLAALLLAVAAGVLL 



WO 99/63088 _ ^ / ^ PCT/US99/12252 

FIGURE 126 

CGGGACTCGGCGG^TCCTCCTGG^ 
TGGTGCTGGTCTTCCTCTGCAGCCTGCTGGCC^ 

GAGAAGGAAATGGACCCTTTTCATTATGATTACCAGACCCTGAGGATT 

CGCTGTGGTCCTCTTCTCGGTTGGGATCCTCCTTATCCTAAGTCGCAG 

TCAATCAGAAGCCCCGGGCCCCAGGAGATGAGGAAGCCCAGGTGGAGAACCTCAT 

AATGCAACAGAGCCCCAGAAGCAGAGAACTGAAGTGCAGCCATC^ 

CCTGAGGCXMCTGCTTGAACCTTTGGATGCAAATGTCGATG 

AGCAACAGCCCTTTCCCCAGGAGAAGCCAAGAACTTGTGTC 

ACACCATTCCTCCACCTGATGATGCAACrAA 

GCCCACCTCCCGTGATGTGTGTGTGTGTGTGTGTGTGT^ 

GTCTTTGTGGCTACnTGTTTGTGGATGGTATTGTGTTTGTTAGTGAACT 

CCCAGGCAGGGGCTGAGCCACATGGCCATCTGCTC^ 

CTTCTGCTCCTAGGAGGCTGCTTGTTGCCCGAGACCAGCCCCCTCCCCTGATTTAGGG 
GTAGGGTAAGAGCACGGGCAGTGGTCTTCAGTCGTCTTGGGACCTGGGAAGGT^ 
TTTGTCATCATTCTTCATGGACTCCrTTCACTCCIT^ 
ACCTGATCCCAGTCTGAAGGTCTCTTAGCA^ 

CAGCGTTGACGTCAGGCAGGCTATGC<:CTTCCGTGGTTAATTTCTTCCCAGG 
AGGAGTCCCCATCTGCCCCGCCCCTTCACAGAGTO 

ACTCTGCXTCCTGGGGAATGTGTCCCCTGCATATCTTCTCAGCAATAACTCCATC 
GACCCTACCCCTTCCAACCTTCCCTGCTTCTGAGA^ 

ATGC^GACTACAGTCCCTGCAATTGGGTCTCTGGCAGGCAATAGTTGAAGGACTCCTGTTCC 
GTTGGGGCCAGCACACCGGGATGGATGGAGGGAGAGCAGAGGCCTTTGCTTCTCTGCCTACG 
TCCCCTTAGATGGGCAGCAGAGGCTIACTCCC^ 
GCGGTGAGCGAGGTGGGTTGGAGACTCAGCAGGCTC 
-GTTGAAGGTCATAACGAGAGTGGGAACTC^^ 
CCCGCGGAAACCAACTO^CCGTGaOT 

CCTCAACAACAACAGAAAAAAGGAATAAAATATCCTTTGTTTCCT 



WO 99/63088 j PCTAJS99/12252 

FTftlJRE 127 

MELVLVPLCSLIAPMVIASAAEKEKEMDPFHYDyQTLRIGGLVPAVVLPSVGILLILSRRCK 
CSFNQKPRAPGDEEAQVENLITANATEPQKQRTEVQPSGGSLWNLRRLLEPLDANVDA 



WO 99/63088 , PCT/US99/12252 

FIGURE 128 

AAACTTGACGCC^AAGATCCCGGTCCTTCCTGCCGTGGTGCTCCTCTCCCTCCTGGTGCT 
CC^CTCTGCCCAGGGAGCCACCCTGGGT<MTCCTGAGGAAGAAAGCACCATTGAGAATTATG 
CGTCACGACCCGAGGCCTTTAACACX:C<X3TTCCTGAACATCGACAAATTGCGATCTGCGTTT 

aaggctgatgagttcctgaactggcacg<x:ctct 

CCTCAACTGGGATGCCITTCCTAAGCTGAAAGGACTGAGG^^ 

GACCATGACCTCCACTGGAAGAGGGGGCTAGCGTGAGCGCrGATTCTCAACCTACCATAACT 
CTTTCCTGCCTCAGGAACTCCT^TAAAACATITTCCATCXyiAA 



WO 99/63088 I PCT/UWOTMB 

FTGTJRE 129 

MKIPVLPAVVIJjSLLVLHSAQGATLGGPEEESTIENYASRPEAFNTPFLNIDKLRSAFKADE 
FLNWHALFESIKRKLPFLNWDAFPKLKGLRSATPDAQ 



WO 99/63088 PCT/US99A2252 

F1RTIRK 130 

CAGTTCTGAAATCAATGGAGTTAAT 

CTTTGCCTCAGTGATTCTCACCTGCCTCT^ 

TTCTCCAGCCAGTTCCZAACTCAGG7VGACAGGTCC 

TTTGCCGGCCACTC ATGAG AGTGTTTTTGTGT^ 

TCATGATTTAATAACCATCCTTTC 

AATTTTTGTTATACTAGATGGCTTCCATTTAC^ 

AGGTTCAAGGTTCATTTGACTTGAGAAAGTGCCCTTCTGGAGCTTCATTC 
TTCACTATTAATTGTAACGATTAAAAAAGAATAAGAGCAC^ 
TATCCCTGGGTGCCCCTGACACATTTATGTAGT^ 
TGTTATTCTAATATTAGTACATTC^ 

AAAAGTTTTGAGTATATTTTTCAACTAGATATTTGTATAGAAAGACTGAATA 



WO 99/63088 ^b^A V ° PCT/US99/U252 

FTCTJRE 131 

MGVEIAFASVILTCLSLLAAGVSQWLLQPVPTQETGPKAMGDLSCGFAGHS 



WO 99/63088 _ /^ VD PCT/US99/12252 



FIGURE 132 



rp^nni & Tf'TOf'&flT A(^TCrrGCCGGCGATGGAGTGGTGGGCTAGCTCGCC<3CTTCGGCTCTG 

GCTGCTGTTGTTCCrcCTCCC 

AAGTATTTATTGACCAAATTAACAGGTCTTT^ 

TGCAGCTGCTACCATGGTGTCATAGAAGAGGATC 

GAAGATGATGGCAGAGGTAGTCAG ACGGAAGCTAGGGACCCACT AT CAGATCACTAAGAAGA 

GACTGTACCXSGGAAAATGACTGCATGTTCXZCCTCAAGGTGTAGTGGTGTTG^ 

TTGGAAGTGATCGGGCGTCTCCCTGACATGGAGATGGTGAT 

GGTTCCTAAATGGATGGAGCCTGCCATCCCAGTCTTCTCCTTCAGTAAGA 

ATGATATCATGTATCCTGCTTGGACATTTTGGGAAG^ 

CCTACAGGTCTTGGACGGTGGGACCTCTTCAGAGAAGATC 

GCCATGGAAAAAGAAAAACTCTACAGCATATTTCCG 

ATCCTCTC7VTTCTTCTGTCTCGGAAAAACC<2AAA^ 

CAGGCCTGGAAATCTATGAAAGATACCTTAGGAAAGCCAGCTGCTAAGG 

GGATCACTGCAAATACAAGTATCTGTTTAATTTTTO^ 

AAC^CCTCTTCCTGTGTGGCTCACTTGTTTTCCATGTTGGTGATGAGTGGCT 
TATCCACAGCTGAAGCCATGGGTTCACTATATCCCA 

AGAGCTGTTACAATTTGTAAAAGCAAATGATGATGTAGCTCAAGAGATTGCTGAAAG 



GCCAGTTTATTAGGAACCATTTGCAGATGGATGACATC 

AGTGAATACTCTAAATTCCTGTCTTATAATGTAACGAGAAGGAAAGGTTATGATCAAATT 

TCCCAAAATGTTGAAAACTGAACTASI^TAGTCATCATAGGACCATAGTCCTCTTTGT^ 

ACAGATCTCAGATATCCTAOMTGA 

TCTGCTATCAAGCC&AATAGCTGGTTTTCCT 

AAGATTTAAAATGTGTCTAATACACTGATATGAAGCAGTTCAACTTTTTGGATGAATAAGGA 

CCAGAAATGGTGAGATGTGGATTTTGAACGCAACTCTACCTTTCATTTTCTTAAGACCAATC 

ACAGCTTGTGCCTCAGATCATCCACCTGTGTGAGTCCATCACTGTGAAATTGACTC 

TGTGATGATGCX^CTTTGTCCCATTATTTGGAGCA^ 

CTCATTGCTGGAATTGTGAAATTATTCAAGGCGTGATCTCTGTCACT^ 

AAACCCTATGGGGTTTATGAAAAATACTTGGGG 

TAGCCATGCCATGCAATGATGTAGGAGTTCTCTC 

GAGGTTTCTATAATGCCACATAGAAAGAGGCCAATTGCATGAG 

TCAGGTTCCCTTTTTGTGCCIT^ 



WO 99/63088 / PCT/US99/12252 

FIGURE 133 

MEWWASSPLRLWLLLFLLPSAQGRQKESGSKWKVFIDQINRSLENYEPCSSQNCSCYH 

EDLTPFRGGISRKMMAEVVRRKLGTHYQITKNRLYRENDCMFPSRCSGVE 

MEMVINVRDYPQVPKWMEPAIPWSFSKTSEYHDIMYPAWTFWEGGPAWPIYPTC 

FREDLWSAAQWPWKKKNSTAYFRGSRTSPERDPLILLSRKNPKLVDAEYTKNQAWKSMK^ 

LGKPAAKDVHLVDHCKYIC^FNF.RGVAA^ 

YIPVKTDLSNVQELLQFVKANDDVAQEIAERGSQFIRNH^ 

NVTRRKGYDQ I I PKMLKTEL 



WO 99/63088 / PCT/US99/122S2 

FIGURE 134 

CACCCCTCCATTTCTCGCCATSGCCCCroCACTGCT 

TCATCCTGGCCTTTGGCACCGGAGTGGAGT^ 

GGAGGGATCCGGGAGTCTGGTGGTCCGGATGCCCGCCAGGGATGG 

CCGCAGCATCCTTGCCCCCCTGGCATGGGATCTGGGGCTCCTGCTTCTATTTG 

ACAGCCTCATGGCAGCTGAAAGAGTGAAGGC^^ 

AGGTCACTGTATGTGGCCTGCACTGCCCTGGCCT 

CATACCCAAAGGCCCTGTGTTGTGGGAGGCTCGGGC^ 

TCCTCTGCTTTGTGCTCCATGTCATCTCCT^ 

GACTATCCTGAGCTCATGGGCCTCAAAC^ 

TCTGGCCCTGAAGTCTCCCCGGGCTCTCAGACT 

TGGAGCTGCTGACAGTGCTGTGGGTGGTGCCTACCCT 

TTCCTCCTTACCCTCTACCTGGGCCT^ 

CCGGGCCGAGOTACAAAGAAAACTCCACCTC 

GAGGAGCTCACTCTGGTTACAAGCCCTGTTCT 

ATCCAGGCCCTGGCTGCTTCATGCCAGAGGCCCAAATCCATGGACTC 

CTACTACTTGAGACTTTATTCTCTGGGTCCAGCTCCATACCCTAAATTCTGAG 

CTGAACTCC^GGTCCACTTCTCAC^ 

TTCACTGTTTAGAGCATCACACT 

CTGACC^CTCCCCTGGCACTGTTAC^^ 

GGCTTCCACTCCAAGAAGGTGGACCAGGGTCTGC^ 

GGCCCCAACCTTGCCTC^CCACTCCC^C^ 

GGGCTCAGACCCCAACCTAGTCAAGGGGATTCTC^ 

CCTGCTCTCCCGAGGAAGATGCTCT^ 



WO 99/63088 / PCT/US99A2252 

FfttTIRE 135 

MAPALLLI PAALASPI LAFGT6VEFVRFTSLRPLLGGI PESGGPDARQGWLAALQDRS I LAP 
IAWDI^LLLLFVGQHSmAAERVKAWTSRYFGVI^RSLWACTAI^QLVMRyWEP I PKGPV 
LWEARAEPWATWVPLLCFVLHVISWLL I FS I LLVFD YAELMGLKQVYYHVLGLGEPLALKS P 
RALRLFSHLRHPVCVELLTVLWVVPTLGTDRLLLAPLLTLYLGLAHGL^ 



KLHLLSRPQDGEAE 



WO 99/63088 



PCTVUS99/122S2 



CCGAGC^CAGGAGATTGCCTGCGTTTAGGAGGTGGCTGCGTTGTGGGAAAAGCTATCAAGGA 

AGAAATTGCCAAACCATGTCTTTTTTTCTGTTTTCAGAGTAGTTCACAACAGATCT 

TTTAATTAAGCATGGAATACAGAAAAC^CAAAAAACTTAAGCTTTAATTT(^TCT 

CCIACAGTTTTCTTAGCTCCCTGGACCCOGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTA 

TC^CGTGGTGCTCTCCGAOTACTCACCCCGAGTGTAAAGAACCTTCGGCTCGCGTGCTTCTG 

AGCTGCTGTGGATGGCCTCGGCTCTCTGGACTGTCCTTCCGAGTAGGATGTCACTGAGATCC 

CTCAAATGGAGCCTCCTGCTGCTGTCACTCCTGAGTTTCTTTGTGATGTGGTACCr 

TCCCCACTACAATGTGATAGAACGCGTGAACTGGATGTACTTCTATGAGTATGAGCCGATTT 

ACAGAC^GACTTTCACrTCACACTTCGAGAGCATTCAAACrGCT 

CTGGTCATTCTGGTGACCTCCCACCCTTCAGATGTGAAAGCCAGGCAGGCCATTAGAGTTAC 
TTGGGGTGAAAAAAAGTCTTGGTGGGGATATGAGGTTCTTACATTTTTCTITATTAGGCC^AG 
AGGCTGAAAAGGAAGACAAAATGTTGGCATTGTCCTTAGAGGATGAACACCTTCTTTATGGT 
GACATAATCCGACIAAGATTTTTTAGACACATATAATAACCTGACCTTGAAAACCATTATGGC 
ATTCAGGTGGGTAACTGAGTTTTGCCCCAATGCCAAGTACGTAATGAAGACAGACACTGATG 
TTTTCATCAATACTGGGAATTTAGTGAAGTATCTTTTAAACCTAAACCACTCAGAGAAGTTT 
TTCACAGGTTATCCTCTAATTGATAATTATTCCTATAGAGGATTTTACCAAAAAACCCATAT 
TTCTTACCAGGAGTATCCTTTCAAGGTGTTCCCTCCATACTGCAGTGGGTTGGGTTATATAA 
TGTCCAGAGATTTGGTGCCAAGiGATCTATGAAATGATGGGTCACGTAAAACCCATCAAGTTT 
GAAGATGTTTATGTCGGGATCTGTTTGAATTTATTAAAAGTGAACATTCATATTCCAGAAGA 
CACAAATCTTTTCTTTCTATATAGAATCCATTTGGATGTCTGTCAACTGAGACGTGTGATTG 

O^CCATGGCTTTTCITCCAAGGAGATC^^ 
ACATGCCATTATl^CTTC^C^TTCTACAAAAAGCCTAGAA 

GTGTTAAATAAAGTAGGTACTGTGGAAAATTCATGGGGAGGTCAGTGTGCTGGCTTACACTG 
AACTGAAACTCATGAAAAACCCAGACTGGAGACTGGAGGGTTACACITGTGATTTATTAGTC 
AGGCCCTTCAAAGATGATATGTG G AGGAATTAAATATAAAGGAA TTGGAGGT TTITXSCTAAA 
GAAATTAATAGGACCAAACAATTTGGAGATGTCATTCTGTAGACTAGAATTTCTTAAAAGGG 
TGTTACTGAGTTATAAGCTCJ^CTAGGCTGTAAAAACAAAACAATGTAGAGTTTTATTTATTG 
AACAATGTAGTCACTTGAAGGTTTTGTGTATATCTTATGTGGATTACCAATTTAAAAATATA 
TGTAGTTCTGTGTCAAAAAACTTCTTCACTGAAGTTATACTGAACAAAATTTTACCTGTTTT 
TGGTCATTTATAAAGTACTTCAAGATGTTGCAGTATTTCACAGTTATTATTATTTAAAATTA 
CTTCAACTTTGTGTTTTTAAATGTTTTGACGATTTCAATACAAGATAAAAAGGATAGTGAAT 
CATTCTTTACATGCAAACATTTTCCAGTTACTTAACTGATCAGTTTATTATTGATACATCAC 
TCCATTAATGTAAAGTCATAGGTCATTATTGCATATCAGTAATCTCTTGGACTTTGTTAAAT 
ATTTTACTGTGGTAATATAGAGAAGAATTAAAGCAAGAAAATCTGAAAA 



WO 99/63088 / PCT/US99/122S2 

FIGURE 137 

MASALWTVLPSRMSLRSLKWSLLLL^ ERVNWMYFYE YE PIYRQD 

FHFTLREHSNCSHQNPFLVILVTSHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLL 
EDKMLALSLEDEHLLYGDIIRQDFLDTYimLTLKTIMAFRWVTEFCPNAK^ 
TGNLVKYLLNLNHSEKFFTGYPLI^ 

LVPRI YEMMGHVKP I KFEDVYVGI CLNLLKVNIHI PEDTNLFFLYRIHLDVCQLRRVI AAHG 
FSSKE I ITFWQVMLRNTTCHY 



WO 99/63088 ^^WA ^ O PCT/US99/12252 

FIGITRK 138 

CCTCTGTCCACTGCTTTCGTGAAGACAAGATCAAGTTCACAATTGTCTTTG 
GGAGTCTTTCTAGCTCCTGCCCTAGCTAACTA 

TGCTGGAAGTGGGCAGC^GTCAGTGAGTGTCAACAATGAACACAATGT^ 

ATAAC^CXX^TGGGACrCOTGGAATTCCATCTGGGATTATC 

AGACTCTTTCAAAAGAAGACATCK^TTC 

TCAATCCCTTGATGCACTGGTCAAGGAAAAGAAGCTTCA^ 

CTCCCAAGGGCCTGATGTACTC^GTCAArc 

AAAAACATTGCAAACATGTGTCGTGGGATTCCAACAT^ 

AAGCCTGTTTTTTTACTCAGGAACGTGCTACACGACCAGTGTACTATGGACT 

CCTTCTGTGGAGACACOGTGGAGAACTA^CAAT^ 

CTGAATATGCTGTGCAGAAAAAATATGGGCTCCAGTGGTTTTTACCA 

TTTCTCTACTAGTTATGTTTGATTTClTTAA 



wow/uosa _^\\Zl^° rctiastmzta 

FTCTTRE 139 

MKFTIVFAGLLGVFLJUPAIiAire^ 

IWDYGNGFAATRLFQKKTCIVHKMNKEVMPSIQSLDALVKEKKLQGKGPGGPPPKGLMYSVN 
PNKVDDLSKFGKNIANMCRGIPTYMAEEMQEASLFFYSGTCJrrTSVLWIVDISFCGDTVEN 



WO 99/63088 _ i _ PCT/US99/12252 

FIG U RE 140 

CATTTCTGAAACTAA/TCGTGTC 
TTAACTTTTTAGGAGTAATTTCTA 

CTCATATATAGGAAAATCGCATATGGTCCTAGTATTAAATTCTTATTCCTT 

TGAGTTAAGAGTTGTTATATGCTAGAATATGAGGATGTGAATATAAATAAGAGAAGAAAAAA 

GAATAAAGTAGATTGAGTCTCCAATTTTATGTAAGCTO 

CAAGCTTATAGTTGAAATATTTTTCAGGAATTA CATGA ATC 

TTGTTCGATTTCAACCAGAGACTATAGC^TGTGCTTGCATCTACCTTGCAGCT 

CAGATTGCGTTGCCAACTCGTCCCCATTGGTTTC 

CCAGGAAATCTGCATAGAAACACTTAGGCTTTA^ 

TGGAAAAAGAAGTAGAAAAAAGAAAAGTAGCCTTACAAGAA^ 

TTGAATCCGGATGGAACTCCAGCCCTTTCAACCCT 

ATCATCACCAAGAGAAGTAAAAGCTGAAGAGAAATCACCAATCTGCATTAATGTGAAGACAG 

TCAAAAAAGAACCTGAGGATAGACAACAGGCTTCCAAAAGCCCT 

GACAGCAAGAGAAGTAGAAATAGCAGAAGTGCAAGTCGATCGAGGTC^ 

TTCTAGATCACATACTCCAAGAAGACACTATAATAATAGGCGGAGTCGATCT 

GCTCGAGATCAAGAAGCAGGTCCCGCAGTCACAGTGAAAGCCCT 

GGTTCTCCTCACCTTAAGGCCAAGCATACCAGAGATGATTTAAAAAGTTOV^ 

TCATAAAAGGAAAAAATCTCGTTCTCGATCTCA^^ 

CCAAGAAACACAGGCATGAAAGGGGACATCATA<^ 

GAGAGGTCCCATAAAAGCAAGCACCATGGTGGCAGTCGCTCAGGACA 

CTOftCTTTCTCTTCCTTTGAGCCTO 

ATGGACTCAATCAAAAACATTAAACGCAAACTGATTAGGATTTGATTTCTTC 

GGTCTCTAGAACACTGAGGACAGTTTCTTTTGAAAAGAACTATGTO 

AAAATGCCCTAGCAGTATCTAATTAAAAACCATGGTCAGGTTCAATTGTACT 

TGTGTATTGTTTATTGCTATAAGAACI^ 

ATACAGATAAAATTGCAGACACTGTTCTATTT^ 

ACTTTCTTAACACTGGTTTGTCTGCATGTGTAAAGATT 

CTTGTTTTTTCTAAAAAAAAAAAAAAAAAAGT 



WO 99/63088 ) PCT/US99/12252 

FTCTJRE 141 

^INDSLRTNVFVRFQPETIACACIYLAARALQIPLPTRPHWPLLFGTTEEEIQEICIETIJRLY 
TRKKPNYELLEKEVEKRKVALQEAKLKAKGIJJPDGTPALSTLGGFSPASKPSSPREVKAEEK 
SPISINVKTVKKEPEDRQQASKSPYNGVRKDSKRSRNSRSASRSRSRTRSRSRSHTPRRHYN 
NRRSRSGTYSSRSRSRSRSHSESPRRHHNHGSPHLKAKHTRDDLKSSNRHGHKRKKSRSRSQ 
SKSRDHSDAAKKHRHERGHHRDRRERSRSFERSHKSKHHGGSRSGHGRHRR 



WO 99/O088 ^ / ^ PCTAJS99/12252 

FIGURE 142 

TGGGGATAAAGGAAAAATGGTCAGGTATTAATGGCT 

TTTTTTGAANNTATTCX3GGTCANAATTGNCTO 

TANCTTTTTAGAGTAATTTCTAGTTTGGA^ 

CATATATAGGAAAATCGCATATGGTCXn'AGTATTAAATTNTTATTGCT 

AGTTAAGAGTTGTTATATGNTAGAATATGAGGATGTGAATATAAATAAGAGAAGAAAAAAGA 

ATAAAGTAGATTGAGTCTCCAATTTTATCTAAGC 

AGCTTATAGTTGAAATATTTTTCAGGAATTACAT 

GTTCGATTTCAACCAGAGANTATAGCATGTGCTTGCATCT 

GATTCCGTTGCCAACraGTCCCCATT<X5TTTCTTCT 

AGGAAATNTGCATAGAAACACTTAGGCITT^ 

GAAAAAGAAGTAGAAAAAAGAAAAGTAGCCTTACAAGAAGCCNAATTAAAAGCAAAGGGATT 
GAATCCGGATGGAACTCCAGCC<nTTCAACCCTGGGTG^ 



WO 99/63088 _>V\^/o-iO PCIWS99/122S2 



FTOTTRE 143 



GGCACGAGGCCTCGTGCCAAGCTTGGCACGAGGGTGCACCGCGTTCTCGCACGCGTC^SSGC 

GGTCCTCGGAGTACAGCTGGTC^TGACCCTGCTCACTGCCACCCTCATGCACAGGCTGGCGC 

CAC^CTGCTCCTTCGCGCGCTGGCTGCTCTGTAACGGCAGTTTGTTCCGATACAAGCACCCG 

TCTGAGGAGGAGCTTCGGGCCCTGGCGGGGAAGCCGAGGCCCAGAGGCAGGAAAGAGCGGTG 

GGCCAATGGCCTTAGTGAGGAGAAGCCACTGTCTGTGCCCCGAGATGCCCCGTTCCAGCTGG 

AGACCTGCCCCOTCACGACCGTGGATGCCCTGGTCCTGCGCTTCTTCCTGGAGTACCAGTGG 

TTTGTGGACTTTGCTGTGTACTCGGGCGGCGTGTACCTCTTCACAGAGGCCTACTACT 

GCTGGGACCAGCCAAGGAGACTAAC^TTGCTGTGTTCTGGTGCCTGCTCACGGTGACCTTCT 

CCATCAAGATGTTCCTGACAGTGACACGGCTGTACTTCAGCGCCGAGGAGGGGGGTGAGCGC 

TCTGTCTGCCTCACCTTTGCCTTCCTCTTCCTGCTGCTGGCCATGCTGGTGCAAGTGGTGCG 

GGAGGAGACCCTCGAGCTGGGCCTGGAGCCTGGTCTGGCCAGCATGACCCAGAACTTAGAGC 

CACTTCTGAAGAAGCAGGGCTGGGACTGGGCGCITCCTGTGGCCAAGCTGGCTATCCGCGTG 

GGACTX3GGAGTGGTGGGCTCTGTGCTGGGTGCCTTCCTCACCTTCCC^ 

CCAGACCCACCGGGACX3CACTGACCATGTCGGAGGACAGACCCATGCTGCAGTTCCTCCTGC 

ACACCAGCTTCCTGTCTCCCCTGTTCATCCTGTGGCTCTGGACAAAGCCCATTGCACGGGAC 

TTCCTGCACCAGCCGCCGTTTGGGGAGACGCGTTTCTCCCTGCTGTCCGATTCTGCCTTCXjA 

CTCTGGGCGCCTCTGGTTGCTGGTGGTGCTGTGC CTGCTG CGGCTGGCGGTGACCCGGCCCC 

ACCTGCAGGCCTACCTGTGCCTGGCCAAGGCCCGGGTGGAGCAGCTGCGAAGGGAGGCTGGC 

CGCATCGAAGCCCGTGAAATCCAGCAGAGGGTGGTCCGAGTCTACTGCTATGTGACCGTGGT 

GAGCTTGCAGTACCTGACGCCGCTCATCCTCACCCTCAACTGCACACTTCTGCTCAAGACGC 

TGGGAGGCTATTCCTGGGGCCTGGGCCCAGCTCCTCTACTATCCCCCGACCCATCCTCAGCC 

AGCGCTGCCCCCATCGGCTCTGGGGAGGACGAAGTCCAGCAGACTGCAGCGCGGATTGCCGG 

GGCCCTGGGTGGCCTGCTTACTCCCCTCTTCCTCCGTGGCGTCCTGGCCTACCTCATCTGGT 

GGACGGCTGCCTGCCAGCTGCTCGCCAGCCTTTTCGGCCTCTACTTCCACCAGCACTTGGCA 

GGOTCCTAGCTGCCTGCAGACCCTCCTGGGGCCCTGAGGTCTGTTCCTGGGGCAGCGGGACA 

CTAGCCTGCCCCCrCTGTTTGCGCCCCCGTGTCCCCAGCTGCAAGGTGGGGCCGGACTCCCC 

GGCGTTCCCTTCACCACAGTGCCTGACCCGCGGCCCCCCTTGGACGCCGAGTTTCTGCCTCA 

GAACTGTCTCTCCTGGGCCCAGCAGCATGAGGGTCCCGAGGCCATTGTCTCCGAAGCGTATG 

TGCCAGGTTTGAGTGGCGAGGGTGATGCTGGCTGCTCTTCTGAACAAATAAAGGAGCATGCC 



GATTTTTAA 



WO 99/63088 ^VV^/^ 0 PCT/US99/I22S2 

FTGTJRE 144 

^VLGVQLVVTLLTATLMHRIAPHCSFARWLLCNGSLFRYKHPSEEELRALAGKPRPR<3RKE 
RWANGLSEEKPLSVPRDAPFQLETCPLTTVDALVLRFFLEYQWFVDFAVYSGGVYLFTEAYY 
YMLGPAKETNIAVFWCLLTVTFSIKMFLTVTRLYFSAEEGGERSVCLTFAFLFLLLAMLVQV 
VREETLEI^LEPGIJ^MTQNL^PLLKKQGWDWALPVAKIAIRVGIAWGSVI^AFLTFPGLR 
IAQTHRDALTMSEDRPMLQFLLHTSFLSPLFILWLWTKPIARDFLHQPPFGETRFSLLSDSA 

FDSGRLWLLVVLCLLRIAWRPHLQAYLCIAKARVEQLRR^ 

WSLQYLTPLILTLNCTLLLKTLGGYSWGLGPAPLLSPDPSSASAAPIGSGEDEVQQTAARI 
AGALGGLLT PLFLRGVLAYL I WWT AACQLLAS LFGLYFHQHLAGS 



WO 99/63088 _^H&/l>V© PCT/US99/122S2 

FTOIJRE 145 

CGTTNGCACGCGTCAATGGCGGTCCTCGGAGTACAGCTGGTGGTGACCCTGCTCACTGCCAC 

CCTCATGCACAGGCTGGCGCCACACTGCTCCTTCGCGCGCTGGCTGCTCTGTAACGGCAG^ 

TGTTCCGATAC^GCACCCGTNTTGAGGAGGAGCTTCGGGCCCTGGCGGGGAAGCCGAGGCC 

CAGAGGCAGGAAAGAGCGGTGGGCCAATGGCCTTAGTGAGGAGAAGCCACTOTCTGTGCCCC 

GAGATGCCCCGTTCCAGCTGGAGACCTGCCCCCTCACGACCGTGGATGCCCTGGTCCTGCGC 

TTCTTCCTGGAGTACCAGTGGTTTGTGGACTTTGCTGTGTACTCGGGCGGCGTGTACCTCTT 

CACAGAGGCCTACTACTACIATGCTGGGACCAGCCS^GGAGACTAACATTGCTGTGTTCTGGT 

GCCTGCTCACAGTGACCTTCTCCATCAAGATGTTCCTGACAGTGACACGGCT 

GCCGAGGAGGGGGGTGAGCGCTCTGTCTGCCTCACCriTTGCCTTCCTCTTCC 

CATGCTGGTGCAAGCG 



WO 99/630*8 



PCT/US99/12252 



FIGURK 146 

GGTTCCTACATCCTCTCATCTGA^ 

TTAACGTGGCTTAATCTGAAGGTTCTCAGTCAAATTCT 

ATGGCAAGGTTTGCTTAAAGGAGCTTGGCTOT 

CAGGGAGAATGCAGCACACTGCTCGGAGAATOAAGGCGCTTCTGTT 

CTCAGTCCTGCTAACnACATTGACAATGT^ 

TAAAGGTGCCTCCCACTACGGCCTGACCAAA 

CAGACGGCTGTGCGAGCCTCACAGCCACGGCT 

ATCTCCTTAATGACAGACGAGCCTGGCCTAGACAACCCTGCCTACGTC 

CGGGCAGCCAGCAATCAGCCCAGTGGACTCTGGCCGGAGCAAC 

TTGAGAGATCCACTATTAGAAGCAGATCATTTAAAAAAATAAATCGAGCTCT 

CGAAGGACAAAGAGCXjGGAGTGCAGTTGCCAACCATGC^ 

AAACACCACTGCCCCTGAAGTCTTTCCAAGGTTGTACCACCTGAT^ 

CGAGCATCAAGATCAATCGAGTAGATCCCAGTGA 

AGCGAAACCCCACTGGTCCATATGATTATCC^ 

AGACGGCCGGCTACTGCCAGGAGACATCATTCT^^ 

TCCCTCAC7UVCTACGCTGTGOGTCTCC3X30GGCAGCCCTC 

ATGCGTGAACAGAAGTTCCGCAGCAGGAACAATG^ 

AGATGACAGCTTTCATGTGATTCTCAAC^ 

TGGTGCGCT^GGTGGATGAGCCTGGGGTTTTCATCTTCAATGTGCT 

TATCGAGATGGTCAGCTTGAGGAGAATGACCGTGTGTTAGCCATCAATGG 

ATATGGCAGCCCT^GAAAGTGCGGCTCATCT 

TCX3TGTCCCGCCAGGTTCGG<^GCGGAGCCCTGACATCTTTCAGGAA 
AATGGCAGCTGGTCCCCAGGGCCAGGGGAGAGGAGCAAC^ 

AATTACTTGTCATGAGAAGGTGGTAAATATCCAAAAAGACCCCGGTGAATCTOTC^ 
CCGTCGCAGGGGGAGCATCACATAGAGAATGGGATTTC 

JZCCGGAGQAG.^^ 

TGGGGTCGAACTGACAGAGGTCAGCCGGAGTGAGGCA^ 
CCTCGATAGTACTCAAAGCTTTGGAAGTCAA^ 

CCAGCAGCCCTGGACTCCAACCACAACATGGCCCCACCCAGTGACTGGTC 
CATGTGGCTGGAATTACCACGGTGCTTGTA^ 
CAGCTGGAAGTCTGGGCTTCTGCATTGTA^ 
TTTTTGATCAAATCCATTGTTC 

TGATATTCTTCTTGCTGTCAATGGTAGAAGTACATCAGGAATGATACATGCT^ 

GACTGCTGAAAGAACTTAAAGGAAGAATTACTCTAACTATTGTTTCTT^ 

TTAT^AATCAATGATGGGTCAGAGGAAAACAGAAAAATC 

ACACTATATTTATCTTGTCAGTTTTTATATTTAAAGAAAGAATACATTC 

AAAAGTATGATCATCTAATGAAAGCCAGTTACACCTGAGAAAAT^ 

AAACTACTAGTTTTTTTTCAGTGTGGAGGATTTCTCATTACTCT 

TTTCTATTCAATAAAAAGCCCTAAAACAACTA 

CAAGCTGATTTAAATTTAAAATTTGGTATATGCTGAAGTCTGCCAAGOT 

TTTTTAATTTACAGCTAAAATATTTTTTAAAATGCATTGCTC 

ACAAGAATAAATATTTTTCAGAAGTTAAA 



WO 99/63088 ^.^^^ PCT/US99/12252 

FTGTTRE 147 

MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTAT 
APSPEVSAAATISLMTDEPGLDNPAYVSSAEDGQPAISPVDSGRSNRTRARPFERSTIRSRS 
FKKINRALSVIjRRTKSGSAVANHADQGRENSENTTAPEVFPRLYHLIPDGEITSIKINRVDP 

SESLS IRLVGGSETPLVHI I IQHI YRDGVI ARDGRLLPGD I ILKVNGMDI SNVPHNYAVRLL 
RQPCQVLWLTVMREQKFRSRNNGQAPDAYRPRDDSFHVILNKSSPEEQLGIKLVRKVDEPGV 

FIFNVLDGGVAYRHGQLEENDRVLAINGHDLRYGSPESAAHLIQASERRVHLVVSRQVRQRS 

' PDIFQEAGWNSNGSWSPGPGERSNTPKPLHPTITCHEKWNIQKDPGESLGMTVAGGASHRE 

WDLPIYVI SVEPGGVI SRDGRI KTGD ILLNVDGVELTEVSRSEAVALLKRTS SS I VLKALEV 
KEYEPQEDCSSPAALDSNHNMAPPSDWSPSWVMWLELPRCLYNCKDIVLRRNTAGSLGFCIV 
GGYEEYNGNKPFFIKSIVEGTPAYNDGRIRCGDILLAVNGRSTSGMIHACLARLLKELKGRI 



TLTIVSWPGTFL 



WO 99/63088 _ / PCTAJS99/12252 

FIGURE 148 

CCAAAGTGATCATTTGAAAAAGAGATATCCACATCT^ 

GCACAGGGCAGCriTACTTACTCCAGCACCTTCCTCTCCCAG 

TTGGGATACAATCTCATGGATACGAGGTTTTTAACA 

AATGTTCAGGAGACAGTGACAATTGATAATGAAAAAAATAC^ 

AGGATCATCCTCTTCTACCACAATTTTTGACTATAAACA 

TCTCCCGAAGAGCCTGCTTTATCCTGAAGAT 

CTCCAATGGTACATCTATGAGAAACAGGCTCT 

GGTGAAGTACAACCCT CTGGAGTCTCTGATCAAAGATO 

CCATTGAGAAACTCTGCAAACATATCCCTT^^ 

AATGTCGGTGCTGGAGGCTGTGCAAAGGCT<^ 

TGCAGACATTCATGTTTAGGATGATTAGCCCTCTTGTTTTATC^ 

TTGGTTTACACTCAAAAGTCAAATTAAATTCTTTCCCAATGC 

AGTCAGAAAATATAAATGCTGTATTTATA 



WO 99/63088 PCTAJS99/12252 

FTOTTRE 149 

MKILVAFLVVLTIFGIQSHGYEVFNIISPSNNGGNVQETVTIDNEKNTAIVNIHAGSCSSTT 
IFDYKHGYIASRVLSRRACFILKMDHQNIPPLNNLQWYIYEKQAIJ3NMFSNKYTWVKYNPLE 
SLIKDVDWFLLGSPIEKLCKHIPLYTCGEVVEirniNVGAGGCAKAGLI^ 



WO 99/63088 



FIGURE ISO 



PCT/US99/12252 



GGC^CGAGCCAGGAACTAGGAGGTTCTCACTGCCCGAGC^MIAG^ 
ATGGGGCTCCCTGGGCTGTTCTGCTTGGCCGTGC^ 

GGAGGAAGAAATTACCCCTGTGGTCTCCATTGCCTACAAAGTCCTGGAAGTTTTCCCCAAAG 
GCCGCTGGGTGCTCATAACCTCCTGTGCACCCCAGCC^^ 

TGTGGAACCAAGAACATCAAGGTGGCCAAGAAGGTGGTGAAGACCX^^ 

CAACCTCAACGTCACACTCAAGTCCAGTCCAGACCTGCTCACCT^ 

CCACCTCAGGTGCCCATGTGGACAGTGCCAGGCTACAGATGCACTGGGAGCTGTGGTCCAAG 
CCAGTGTCTGAGCTGCGGGCCAACTTCACTCTGCAGGACAGAGG 

GATGATCTGCCAGGCGTCCrCGGGCAGCCCACCTATCACCAACAGCCTGATCGGGAA 

GGCAGGTCCACCTGCAGCAGAGACCATGCCACAGGCAGCCTGCC^ 

AGCCAGA(^TCGGACTGGTTCTGGTGCCAGGCriX3CA 

CCTCACAGTGGTGCCCCCAGGTGGTGACCAGAAGATGGAGGACTGGCAGGGTCCCCTCGAGA 
GCCCCATCCTTGCCTTGCCGCTCTAC^GGACCACCCGCCGTCTGAGTGAAGAGGAGTTTGGG 
GGGTTCAGGATAGGGAATGGGGAGGTCAGAGGACGCAAAGCAGCAGCCATGTAQAATGARrr 
GTCGAGAGAGCCAAGCACGGCAGAGGACTGCAGGCCATCAGC^ 

GTTCATGCAAAATGAGTGTGTTTTAGCTGCrCTTGCCACAAAAAAAAAAAAAAAAAAAAAAA 



WO 99/63088 ^^/^° PCT/US99/12252 

FTCTTRE 151 

MGLPGLFCIAVLAASSFSKAREEE ITPWS I AYKVLEWPKGRWVLITCCAPQPPPPITYSL 
CGTKNIKVAKKVVKTHEPASFNLNVTLKSSPDLLTYFCRASSTSGAHVDSARL^ 
PVSELRANFTLQDRGAGPRVEMICQASSGSPPITNSLIGKDGQVHWQRPCIHRQPANFSFLP 
SQTSDWFWCQAANNANVQHSMjTVVPPGGDQKMEDWQGPLESPILALPLYRSTRRLSEEEFG 

GPRIGNGEVRGRKAAAM 



WO 99/83088 / PCT/US99/12252 

FIGURE 1 52 

GGTCCTT AATGG CAGCAGCGGCCGCTACCAAGA 

CTGTCCGGCTGGTCCCGGGCTGGGCGAGCCGACCCTGACTCTCTTTGCT 

CATCCCTAAGTTCAGACCTGGACCACGGTGGTC 

CTTTTCnTCACTATGACTGTGGCAACAAGAC^ 

CTAAATGTCACAACGGCCTGGAAAGCACAGAACCCAGTACTG 

TACAGAGCAACTGCGTGACATTCAGCTGGAGAATT^ 

AGGCAAGGATGTCTTGTGAGCAGAAAGCTGAAGGA<^ 

TTCGATGGGCAGATOTTCCTCCTCTTTGACTCAGAGAAGAGAATGTGGA 

TGGAGCCAGAAAGATGAAAGAAAAGTGGGAGAATGACAAGGTTGTG^ 

ACTTCTCAATGGGAGACTGTATAGGATGGCTTGAGGACTTCTTGAT 

CTGGAGCCAAGTGCAGGAGCACCACTCGCCAT^ 

AGCCACCACCCTCATCCTTTGCTGCCTCCTCATC^ 

TCTGAGGAGAGTCCTTTAGAGTGACAGGTTAAAGCT 

GTCTTGATCAAACTCGCCCTTCTGTC^ 

GGCCTCCAGCAGATCATGATGACATCATGGACCCAATAGCTCATTCACTG 

TTGCCAACAATTTTACCAGCAGTTATACCTAACATATTATGC^ 

TGATGGAATTCCTGCACTTAAAGTTCTGG 

TCTTTTTGTTTGGAAAATCAAGTACrTCTTTGAATGATGATCr 
GTCAGTAAAATAATCACGTTAGACTTCAGACCT 

AATTTTTAAATTATTTAATAAGAAAAAATTTATATTAATGATTGTTTCCT 
TGTTCrGTACTGATATTTAAATAAAGAGTTCTATTTCCCAAAAAAAAAAAAAAAAAA 



WO 99/63088 / PCT/US99/122S2 

FTCTJRE 153 

MAAAAATKI LLCLPLLLLLSGWSRAGRADPHSLCYD ITVI PKFRPGPRWCAVQGQVDEKTFL 
HYDCGNKTVTPVS PLGKKLNVT TAWKAQNP VLRE WD I LTEQLRD I QLENYTPKEPLTLQAR 
MSCEQKAEGHSSGSWQFSFDGQIFLLFDSEKRMWTTVH 
MGDCIGWLEDFLMG^STLEPSAGAPLAMSSGCT 



WO 99/63088 y ^ PCTAJS99/12252 

FTGTIRK 154 

GGGAAAGCCATTT<XJAAAACCCATCTAT^ 
CCTTGGGCCTCACAATTTTCATTCroTTTTCT^ 

TTGATCCCAACCATAACATCGTGGAGGGTTTTAATTTTGGTGGTAGCCCT 
GTGTGGCTTTCTTTGCAGAGGATTCCACCT 
GAGAATTTGGATTCTACTCTAAAAGTCAATATAGGACTTGGCAA 
TCAACCTGGCCTCCCATAAACAGGACAGATTAT^ 

CAACGGAGGCTATGAAAGCCATGAACAGATTCCAAAAAGAAAACTCAAA 

CCACAGAACAGCATTTCTGGGCCAGGCTGT^^ 

ATTGCTTTTTTCCCCAAAATTAACACA^ 

CCTCTCTCCATTCAAGCATTCAAAGTATATTTTCAATGAATTAAACCT^ 

TTAGATAGGCTTATTCTGACTGTATGCTTTACCAATGAGAGAAAAAAATGCATTTC 

CATCCTTTTCAATAAACTGTATTCATTTTC 



WO 99/63088 C&/^ ° PCT/US99/12252 



FIGURE 155 



MEL I PT ITSWRVL I LWALTQFWCGFLCRGFHLQNHELWLL I KREFGF YS KSQ YRTWQKKLA 
EDSTWPPINRTDYSGIX3KNGFYINGGYESHEQIPKRKLKLGGQPTEQHFWARL 



WO 99/63088 



PCT/US99A2252 



FIGURE 156 

GTTCTCCTTTCCGAGCCAAAATCCCAG^ 

CTCTTGTGGCAGGTAACTGTGCACCACCACACCTGGAATC 

CCTCACGGCGCAAGTGTGGATTCTGTGTGCAGCCATCGOT 

AGAACTGCCCCTCCGTTTGCTCGTGCAGTA^ 

GGCCTCTCCGAGGTCCCGCAGGGTATTCCC^ 

CAACATCCAGATGATCCAGG<XX3ACACCTTCCGCC^ 

TGGGCAGGAACIXXATCmSCAGATTGAGGTGGGGG 

ACCCnKjGAGCTGTTCGACAACTGGCTGACAGTCATCCCTA 

(^GCTGCGGGAGCTCTGGCTTCGCAACAAC^ 

ACCGGGTGCCCTCCCTCATGCGCCTGGACTTGGGGGAGCrrCAAGAAGC^ 

GAGGGAGCTTTTGAGGGGCTGTTCAACCTCAAGTATCTGAACTTGGGCATGTC 

AGACATGCCCAATCTCACCCCCCTGGTGGGGCT^^ 

TCCCTGAGATCAGGCCTGGCTCCITCCATGGCCTGAGCTCCCT 

AACTCACAGGTCAGCCTGATTGAGCGGAATGCTTTTGACGGG 

CAACTTGGCCCACAATAACCTCTCTTCTTTGCCCCATGAC 

TGGTGGAGTTGCATCTACACCACAACCCTTGGAAC 

TGGTGGCTTCGAGAGTATATACCCACCAATTCCACCTGCTGTGGC^ 

GCACATGCGAGGCCGCTACCTC<3TGGAGGTGGACCAGGCCTCCTTCCAGTG 

TCATCATGGACGCACCTCGAGACCTCAACATTTCTGA 

CGGACTCCCCCTATGTCCTC<X3TGAAGTGGTTGCTGCCCAAT^ 

CTCCC^CCACCCAAGGATCTCTGTC 

TTTCAGACACTGGGGTGTACACATGCATGGTC 

GCCTACCTCAATGTGAGCACGGCTGAGCTTAACACCTCCAACT 

AACAGTGGAGACCACGGAGATCTCGCCTGAGGACACAACGCGAAAGTACAAGCCTGTTCCT 

CC&CGTCCACTGGTTACCAGCCGGCATAT^ 

CGTGTGCCCAAGCAGGTGGCAGTACXZCX3CGACAGACA 

GGATGAAGTGATGAAGACCACCAAGATCATCATTGG^ 

CTGCCGCCATGTTGATTGTCTTCTATAAAC 

AC^GCCGCCCGGACTGTTGAGATAATCCAGGT^ 

AGCAGCAACAGCAGCTCCXjTCC<X3TGTATCAGGTC 

ATGACCATATTAACTACAACACCTACAAACCAGCACATGGGG 

CTGGGGAACTCTCTGCACCCCACAGTC 

T A f?C AAGG ACAAGG TAQ^GGAAACTCAAAT ATGACTCCCCTCCCCCAAAAAACTTATAAAAT 

GCAATAGAATGCAGACAAAGACAGCAACTTTTGT^ 

TATGCTTATATATTAAGTCTATGGGCT^ 

CAAAAAGTCAAAACA 



WOW430S8 ^0/'i\0 PCT/US9MJ1SJ 

FIGTIRK 157 

MKLLWQVTVHHHTWNAILLPFVYLTJ^WILCAAIAAAASAGPQNCPSVCSCSNQFSKWCT 
RRGLSEVPQGIPSNTRYLNLMENNIQMIQADTFRHLHHLEVLQLGRNSIRQIEVGAPNGLAS 
LNTLELFDNWLTV I PS GAFE YLS KLRELWLRNNP I E S I PS YAFNRVPS LMRLDLGELKKLEY 
ISEGAFBGLFNLKYLNLGMCNIKDMPNLTPLVGLEELEMSGNHPPEiRPGSFHGLSSLKKLW 

VMNSQVSLIEFJIAFIXSLASLVEIjNIiAHNNLSSLPHDLFT^ 

LAWWLREYIPTNSTCCGRCmPMHMRGRYLVEVDQASPQCSAPPIMDAPRDLNISEGRMAEL 
KCRTPPMSSVKWLLPNGTVLSHASRHPRI SVLNDGTLNFSHVLLSDTGVYTCMVTNVAGNSN 
ASAYLNVSTAELNTSNYSFFTTVTVETTEISPEDTTRKYKPVPTTSTGYQPA 
TTRVPKQVAVPATDTTDKMQTSLDEVMKTTKI I IGCFVAVTLLAAAMLIVFYKLRKRHQQRS 
TVTAARTVEI IQVDEDI PAATSAAATAAPSGVSGEGAWLPT IHDH INYNTYKPAHGAHWTE 
NSLGNSLHPTVTTISEPYIIQTHTKDKVQETQI 



WO 99/63088 
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F IGURE 1 58 

a^CTCGGGCACCAGCOXX^CAAGG 

TTCTTCAGCTCCTTCTCATCTCGTCCTTGCCAAGAGAGTACA 

CCTGGAGCAGAGTGGAATATCATGTGTCGGGAGTGCTGTGAATATGATCAGATTGAGTGCGT 

CTGCCCCGGAAAGAGGGAAGTCGTGGGTTATAC^ 

AGTGTGACTCCTGCCTGATCCACC 

AATGGCTCATGGGGGGGTACCTTGGATGACTTCTAT^ 

CCGAGCAGGCTGGTACGGAGGAGACTGC^ 

GTCAGATTTTGTTGGAAAGCTATCCCCTAAATGCTCACTGTGAATGG^ 

CCTGGGTTTGTCATCCAACTAAGATTTGTCA^^ 

GTATGACTATGTTGAGGTTOSTGAT^^ 

GTGGCAACGAGCGGCCAGCTCCTATCCAGAGCATAGGATCGTCACTC 

TCCGATGGCTCCAAGAATTTTGACGGTTT^ 

CTCATCCCCTTGTTTCCATGACGGCACGTCCGTCCTTGA^ 

CCTGCTTGGCAGGCTATACTGGGCAGCGCTGT^ 

GACCCTGGGGGCCCAGTGAATGGGTACCAGAAAAT^ 

ACGCCATGCTAAAATTGGCACCGTGGTGTCTT^ 

GCAATCAGAAAAGAACTTGCCAGCAGAATGC^ 

AAAGCCTGCCGAGAACCAAAGATTTCAGACCTGGTGAGAAGGAGAGTTCT 

TCAGTCAAGGGAGACACCATTACACCAGCT^ 

AGAGTGCCCCTACCAAGAAGCCAGCCCTTCCCTTTGG 

CIXSCATACCCAGCTCCAGTATGAGTGC^ 

GAGGACATGTCTGAGKSACTGKMAAGTGGAGTGGGC^ 

GGAAAATTGAGAACATCACTGCTCCAAAGACCCAAGGGTTGCGCTGGC 

ATCTACAGGAGGACCAGCGGGGTGCATGACGGCAG 

CTGCAGCX^TGCCCTGGTGAATGA<3CGCACTC 



gatgatga<x:gggatgagaagac<^tccagagccta<^gatttotgct 

caactatgaccccatcctgcttgatgctgacatcgccatcctgaagctcct 

<3tatcagcaccggagtccagcccatctgcctcgctgccagt^^ 

caggagtcccacatcactgtggctggctck^ 

caagaacgacacactgcgctctggggtggtc^ 

agcatgaggaccatggcatcccagtciagtc 

cccactgccccttctgatatctgcactc 

gggacga<3catctcctgagccacgctggcatctgatggg 

aaacatgcagccacaggctctccactc 

gaaagaaatatgaaatg^ccatgctcatgcactccttgagaagtgtttc 

TGTACGTGTGTCATTGCGTGAAGG&GTG 

CTGTOCCAGGGCTTCTGACTTCA 

TGGTAGGCTGATGCOXIGTCCAC^^ 

AAGTAAGTTTCTTCAAAGAAGACCATATACAA^ 

TCCCCAACTTTCAGTTATACGAATGCCATC^ 

GCCCCTTTTGAGGCTCTCAAGTTCTAGAGAGCTGCCTGTGGGAC^^ 

TGGGATGTGGTCCATGCCTTTGTGTAC^TGGC 

ATCTCTTGTACACATTTTAATAAAATAAGGGTTGG 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



WO 99/63088 A, , ^ ■ PCT/OS99/12252 

FTCTTRE 159 

MELGCWTQLGLTFLQLLLISSLPREYTVINEACPGAEWNIMCRECCEYDQIECVCPGKR^ 

GYTIPCCRNEENECTISCLIHPGCTIFENCKSCRNGSWGGTLDDFYVKGFYCAECRAGVnfGGD 

CMRCGQVUIAPKGQILLESYPI^AHCEWTIHAKPGFVIQLRFVMLSLEFDYMCQYDYVEV^ 

GDNRIX^IIKRVCQNERPAPIQSIGSSLHVLFHSDGSKNFDGFHAIYEBITACSSSPCFHDG 

TCVLDKAGSYKCACLAGYTGQRCENLLEERNCSDPGGPVNGYQKITGGPGLINGRHAKIGTV 

VSFFCNNSYVLSGNEKRTCQQNGEWSGKQP I CI KACREPKI SDLVRRRVLPMQVQSRETPLH 
QLYSAAFSKQKLQSAPTKKPALPFGDLPMGYQHLHTQI^YECISPFYRRLGSSRRTCLRTGK 
WSGRAPSCIPICGKIENITAPKTQGI^RWPWQAAIYRRTSGVHDGSLHKGAWFLVCSGALVNE 
RTVWAAHCVTDLGKVTMIKTADLKVVIiGKFYRDDDRDEKTIQSLQISAI ILHPNYDPILLD 
MIAILKLIjDKARISTRVQPICXAASRDIjSTSFQESHITVAGWNVIiADVRSPGFKNDTLRSG 
WSVVDSLLCEEQHEDHGIPVSVTDNMFCASWEPTAPSDICTAETGGIAAVSFPGRASPEPR 
WHLMGLVSWSYDKTCSHRLSTAFTKVLPFKDWIERNMK 



WO 99/63088 ^A<f3/MO PCT/US99/12252 

FIfitJRF. KSfl 

ACCAG6CATTGTATCTTCAGTTGT<!ATCAA<3T 

AGCTTTCTTGCCTGCAGTGAAGCAGAGAGATAGAT^ 

TTCAACCTGACTTTCCACCTTTCCTACAAATTCCGATTACTGra 

GACAGTGGTTGGGTGK3GCCACCAGTAACT^ 

CAAAGGAGTTCATGGCTAATTTCCATAAGACCCTCAI^ 

AATGAAGCATCCACGAAGAAGGTAGAACITC 

AGGCCAGAGCAAGCTCATTTTCAAACCAGATCT^ 

CCAAAGTGTCCAGAGGCCGGTATCX^^ 

GTTCCCCACCGGAACAGAGAGAAACACCTGATGTACCT 

GCAGAGGCAGCAGCTGGATTATGGCATCTACGTCATC 

ATCGAGCOWIACTCTTGAATGTGGGCTATCTAGAAGCCCT 

TTTATATTCCACGATGTGGACCTGGTACOCGAGAATGACTTO 

GCATCCCAAGCATCTGGTGGTTGGCAGGA^ 

ATTTTGGGGGTGTTACIHTCCCTAAGCAGAGXGCAGTT 

AACTACTOGGGATGGGGAGGCGAAGACGATGA^^ 

GAAAATTTCCCGGCCCCIXMCTGAAGTGGGTAAATATA 

AAGGCAATGAGGTGAAC<3CAGAACGGATGAAGCTCTTACACCA^ 

ACAGATGGGTTGAGTAGTTGTTCTTATAAAOT^ 

CAACATCA<^GTGGATTTCTGGTTTGGTGCAT^ 

AACTGATTCTTTGTTTCCAA 

ACCTGTTACAGCTCATTGTTGAGCT^ 

GGTGATGTAGAGTATAAAACAGTTGTAACAAGACAGCTTTCTTAGTCATTO 

GTTAAATATTGTAATATGGATACTTGAAGGACTTTATATAAAAGG^ 

ATGAACGCTATTTGAGGACTCTGGTTGAAGGAGATTTATTTAAATTTGA 

GGGATAAAAGGCCACAGGAAATAAGACTGCTGAATGTCT^^ 

CGAAGGTAGAAAGGTACGAAGATACAATACTC 

AAGTGGTGGTGTCAGGTGAGAAGGCGTCCACAAAA^ 

GACAGTGAACTTGGGAATGAAGAGGTAGCAGGAGGGTGGAGTGTCGGCTGC^ 

TAGCTGAGCTKMTTGCAGGTGCTGA 

AGTGATGCCCACCAGAGAATACATTCTCT^ 

TGTACAAGTAGGATATGAAITAGCAGTTT^ 

CTATCAAATACCTCTGTAGTAAAATGTGAAAAAGCAAAA 
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FIGURE m 
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MGFNLTFHLSYKFRLLLLLTL 

LTNEASTKKVELDNCPSVSP^RGQSKLIFKPDLTLEEVQAENPKVSRGRYRPQ 
AILVPHRNREKHLMYLLEHLHPFLQRQQLDYGIY^ 

WDCFIFHDVDLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGWALSREQFF 

FSNNYWGWGGEDDDLRLRVELQRMKISRPLPEVGKYTMVFHTRDKGNEW 

VWRTDGLSSCSYKLVSVEHNPLYINITVDFWFGA 

Important features: 
Signal peptide: 
amino acids 1-27 

N-glycosylation sites: 

amino acids 4-7, 220-223 and 335-338 



Xylose isomerase proteins: 

amino acids 191-201 
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FIGURE 162 

CGTGCMCCGGGGTCGOSCA^ 

GAGCTCCAGCTGCATTCCCTCCGCGTCCGCCCCACGCTTCT 

GCCCAGGCAGTGTGGTCGCGCCTCGGCOSC^ 

CCCGGCAGGGGTGGCCGCAGGCCTGTATGAACTCAATCTCA 

cgggagcggtggtgaccatctcggcc^ 
cccgctgacgcccacctctacx:gc^ 

GATGGAGAAGGGTCTCAGCTCX!ACCATCCGT^ 

TCTCTGTCTGGGTC^CTGCCGCT^ 

GTCCTCCCCATCACAGAGTTCCTCGTGGGGGACCT 

CTGGCCCAGCTCCTATCTCACTAAGACCGTCCTGAAAGTCTCCTTCCTCCT 

GCAACTTCCTCAAGACCGCCTTGTTTCTCT 

GTGACTGAAGACTCCGTGGTCTATTATAACT^ 

CAAAGTGGTGGCGGAGTGGGAAGAGGTGGAGCCGGATGCCACGAGGGCT 

CCGGGGACTTCTCCGCCTCGCTGAAGCT^ 

CCCACCCTAATTCAGACCTTCCAAAAG^ 

TCTGACTGTGTGCTGGCGTCTCAAGCCTGAG 

CTGTGTCCGTGGCCAGCACAGCGTACAACCTGACCCACACCTTCA 

TGCTTCAGCATCraKMCCGAGAATATCATCAGCAAGACACAT^ 

GTGGCCCTCGAGAATCCAGCCGGCTGTCT^ 

TGTTGGCCTTCATCATGTACATGACCCTGCG^ 

AACCCGGAGCCACCCTCTGGGGTCAGGTGCTC 

GGAGACTCCATCTGAGTACCTGGAAATTGTTCGTGAGAACCACGGGCTGCT 
ATAAGTCTGTCAAAACTTACACCXSTG^ 

CTGACTGCTGACTTGGAGTTTCCAGCAGGGTGGTGTGCACCAOT 
TGCGTGGGGCTGTTGGCCTGGATCATCCATCCAT 
CCTCCCTCTCTGTCACCCCTGAC^^ 
TAAGCCCCACTCGGTTACCACCCCCTTGAC^ 
TTGATGCTTGGGGTGTTCCGTGTTGACTC 
l^CATATTGGCACA 

CTGTGCCAGAGAGCTAGAAAGAAGGTCATAA 

ACATAGATGGGCACACTCACAGAGAGAAGTGTGCATGT^ 

CACACACACACAGAAATATAAACACATGC^ 

TCTGGTTAAGTCGGTTGCTGGGATGCACCCTG^ 

AGCAGCCCTGACAGGTTCTGGGCCCGGGCCCTCCCTTTGTGCTTTC 

GCCCTTTATAAGGCCATCCTAGTCCCTGCT 

AATACTGAGTGATTGCAGAGTGCTTTATAAATATCACCTTATTTTATCGAAAC 

AAACTTTCACTGAGGAAAAGGCCTTGCAGCGGTAGAAGAGGTTGAGTCAA 

TGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGGGGGTC 

GATCGAGACCACCCTGGCTAACACGGTGAAACCCCKjTCTCTA 

AGCCGGGCGT^TGGTGGGTGCCTGTAGTCC^ 

GTGCGAACCCGGGAGGCXSGAGCTTGCAGTGAGC^ 

GTGACAGAGCX3AGACTCTGTCTCCA 



* PCT/US99A12252 

FIGURE l& 

MAQAWSRI/SRILWIiACLLPWAPAGVAAGLYELNLTTDSPATTGAVVTISASLVAKDNGSLA 
LPADAHLYRFHWIHTPLVLTGKMEKGI^STIRWGHVPGEFPVSVWVTAADCWMCQPVARGF 
VVLPITEFLVGDLVVTQNTSLPWPSSYLTKTVLKVSFLLHDPSNFLKTALFLYSWDFGDGTQ 
MVTEDSVVYYNYSIIGTFTVKLKWAEWEEVEPDATRAVKQKTGDFSASLKLQETLRGIQ^ 
GPTLIQTFQKMTVTl^FLGSPPLTVCWRLKPECLPLEEGECHPVSVASTAYNLTHTFRDPGD 
YCFS IRAENI ISKTHQYHKIQWPSRIQPAWAFPCATLITVMLAFIMYMTLRNATQQKDMV 
ENPEPPSGVRCCCQMCCGPFLLETPSEYLE I VRENHGLLPPLYKSVKTYTV 

Important features of the protein: 
Signal peptides 

amino acids 1-24 

Transmembrane domain: 

amino acids 339-362 

N-glycosylation sites. 

amino acids 34-37, .58-61, 142-145, 197-200, 300-303 and 364-367 
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FIGURE 164 
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GCTCAAGACCCAGCAGTGGGACAGCCAGACAGACGGCACG^T^CACTGAGCTCCCAGATCT 

GGGCCGCTTGCCTCCTGCTCCTCCTCCTCCTCSCCAGCCTGACCAGTGGCTCTG 

Cyy^C^GACX3GGACAACTTGCAGAGCTGCAACCCCAGGACAGAGCTGG^ 

GATGCCC^TGTTCCAGAGGCGAAGGAGGCGAGA<^CCCACTTC^ 

GCGGCTGCrroTCATCGATCAAAGTGTGGGATGTGCTGCAAGACGTAQAACOTACCTC 

CCCCCGTCCCCTCCCTTCCITATTTATTCCT 

TGGCTGGTTCi M lTl^TTTTCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



WP 99/63088 ° KXMBMUm 

FTttTTRE 165 

MALSSQIWAACLLLLLLLASLTSGSVFPQQTGQIiAEIjQ^ 
FPICIFCCGCCHRSKCGMCCKT 
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FIGURE 166 
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CTGTGAGGAAGGACCATCTGAAGGCTGCAATTTGTTCTTAG 
CTGGATCTTCCACC ATGT TCCTG^ 

ATCTCCCTGACTGTCCTCITCACCCTCCTTCTCGTTTTCATCATAGTGC 

AGTCTCCTTTGGTATCCGCAAACTCTACATGAAAAGTCTG 

CCITGAGAATGGAGCGAGGAGCCAAGGAGAAGAACC^CCAGCTT^ 

GGAATCATTGCAAAGGATCCCACTTCACTAGAAGAAGAGATCAAAGAGATTCGTCGAA^ 

TAGTAGTAAGGCTCTGGACAACACTCCAGAGTTCGAGCTCTCTGACATTTTCT 

GGAAAGGAATGGAGACGATTATGGATGATGAGGTGACAAAGAGA 

GAGTCCTGGAACCTGCTGAGCAGAACCAATTATAACTTCCAGTACATCAGCCT^ 

GGTCCTGTGGGGGTTAGGAGTGCTGATTCGGTACTGCTTC 

TGGCTTTCACAGGGATTAGCCTTCTGGTGGTGGGCACAACT 

GGGAGGTTTAAGGAATTCATGAGTAAACATGTTCACTTAATGTGTT 

AGCX3CTGACAGCCATCATCACCTAGCATGACAGG 

GTGTGGCCAATCATACCTCACCGATCGATGTGATCAT^ 

ATGGTGGGTCAAGTGCACGGGGGACTCATGGGTGTGATTCAGAGA 

CCCACACGTCTGGTTTGAGCXjCTCGGAAGTGAAGGATC 

CTGAACATGTGCAAGATAAAAGCAAGCTGCCTATCCTC 

AATAATAGATCGGTGATGATGTTCAAAAAGGGAAGTTTTGT^^ 

TGTTGCTATCAAGTATGACCCTCAATTTGGCGA 

TGGTGACGTACCTGCT<3CGAATGATGACCACX!TGGGCCATTGTCTG 

CCTCCCATGACTAGAGAGGC^GATGAAGATGCTGTCaVGTTTGCGAATAGGGTGAAA 

CATTCCCAGGCAGGGAGGACTTGTGGACCTGCTG^^ 

TGAAGGACACGTTCAAGGAGGAGCAGCAGAAGCTOT^ 

AAGGACAGGAGCCGCTCCTGAGCCTGC<irr<2!CA<5CTGG CTCXXIGCC A rrvyrVOTVyreTYyv^ft ft 
CX^OTCAGAGCTCGAGTTGCO^CGG<XX:G€CCACTGC^ 
— GTGGGGGGGG-TX3CTGTGGATGGGAGGACTGCGGCTTTCGCCGAGCGGCA 

gcacck:ggcggagcctacgcttggtggtgtaaagggatgctggtgggtc 

CGAGATGCCTTGTTTCTTTTACAATAAGTCGTTGG 

CCTTTGCACGCTGTGCGGGCTGAGTGGTTGGGGAGATGTGGCCATGGTCTTGTGCT 

GGCGGTACAAGAGTCTGTTATGCAAGCCCGTGTGCCA^ 

CTCTCCAGGAAAGGCACAGCTGAGGCACTGTGGCTGGCTTCGGCCrcAACA 

CTTGGAGCTCTGCAGACATGATAGGAAGGAAACTGTCATCrGCAGGGGCT 

AAGGGTTAGATTTTTATGCTGCTGCTGATGGGGTTACTAAAGG^ 

GGCCGCTGACTGGGCCATGGGGAGAACGTGTGTTCGTAGTCCAGGCTAACCCTGAACTC 

ATGTGATGCGCGCTTTGTTGAATGTGTGTCTCra 

GAATGGTGGTGATTCCTACCTCAGAGGGCrGTTCT<XXX3A 

AGGACACATCACGTTCAGTGTTTGAAGTACAGGCC^ 

CTCAGAGCTGCTGCAGTGGGCTITGG 
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FTGTTRE 167 

MFLLLPFDSLIVNLLGISLTVLFTLLLVFI IVPAIPGVSFGIRKLYMKSLLKIFAWATLRME 
RGAKEKNHQLYKPYTNGIIAKDPTSLEEEIKEIRRSGSSKALDNTPEFELSDIFYFCRKGME 
TIMDDEVTKRFSAEELESWNLLSRTNYNFQYISLRLTVLWGLGVLIRYCFI1LPLRIALAFT6 

ISLLWGTTWGYLPNGRFKEFMSKHVHLMCYRI CVRALTAI I TYHDRENRPRNGG I CVANH 
TSPIDVIILiASDGYYAMVGQWGGIJ^GVIQRAMVKACPHWFERSEVKDRHLVAKRLTEHVQ 
DKSKLPILIFPEGTCINNTSVMMFKKGSFEIGATVYPVAIKYDPQFGDAFVWSSKYGMVTYL 
LRMMTSWAIVCSVWYLPPMTREADEDAVQFANRVKSAIT^O^LVDLiWIXMLKREKVKDTF 

KEEQQKLYSKMIVGNHKDRSRS 
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PCTAJS99/122S2 




EKz 



UEEJi8 



GCCCCTCGAAACCAGGACTCCAGCAC 

CGTCTCCTCCAGGGATTOCGCTGGCGGCTTTGATGATCGCCCTC 

ACCTGGCAGGCCCAGGCTGTTCCCACCATCCTGCXICCTC 

O^TACCTATGTGGGTTGTGCAGAGGAGATGGAGGAGAAGGCAGCCCCC 

AAATGGCCCACC^TGCCCTGCTGCGGGAATCCTGGGAGGCAGCCCAGGAG 

AAGCGTGGAGGGCTTACCTTGCCCCCTGGCTTCAAAGCCCAGA^ 

CTACACCAACTCATCGAACACCTTGT^ 

GCTCCCGGGAGCTCTACATCAGGCACTTTCCCTTCAAGGCCCTGCATT^ 
GCCCTGCAGCTGCTGCGAGGCAGTGGGGGCTGCAGCAGGGGACCT 
AGGTGTGGGCAGCCTTOGCTTTGAACCC^ 
TTGCCTCCAGCTCCCTGGATAAGGCAGTG^ 

GTGTCTGCGCCAGGGGTGCAGCTAGGGTCACAATCTGAGGGGGCCTCCT 

GAAGACTCTGCTCTTGGCCCCTGGAGA 

ACATCT<XZCACTTAGGAGCCCTGGGAAC^ 

CAGCCTTGAGAAGCAAGAACATGGTTCCGGACCCAGCCCTAGCA 
ATGTTGGCCTGGGGAGGCCACAGCAGGGCTGAGGGAACTCTGCT 
GGGACAAGCAAGGAAAGTACTGAGGCAGCCACTTGATTGAAOK5TGTTGCAATC 



TGGAGTTTTATTGAGGTAGCTACGTGATTAAATGGTATTGCAGTGTGGA 



WO 99/63088 ^Atftfao "™S99/1»52 

FTOTTRTC 169 

MAIJ^ALMIALGSLGLHTWQAQAVPTILP 

ALIJIESWEAAQETWEDKRRGLTLPPGFKAQNGIAIMVYTNSSNTLYT^LNQAVRTC 
YMRHFPFKALHFYLIRALQLLRGSGGCSRGPGEVVFRGVGSLRFEPKRLGDSVRLGQFASSS 

LDKAVAHRFGEKRRGCVS APGVQLGSQSEGAS S LP P WKTLLLAPGEFQLSGVG P 
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FIGURE 170 

GTGGCTTCATTTCAGTGGCT 

CCCTCATCTATATCCTTTGGCAGCTCACA^ 

GTCGGTTCCGTTGGTGGGGCCGTGACTTTC^ 

TATTGTCTGGACCITCAACACAACCC^ 

TAGTGACCX]AAAATCGTAATAGGGAGAGAGTAGACTTC 

CTCAGCAAACTGAAGAAGAATGACTCAGGGATCTACTATG 

CCAGCAGCCCTCCACCCAGGAGTAC^ 

TCACCATGGGTCTGCAGAGCAATAAGAATGGCACCTGTGTGACCAATCT 

GAACATGGGGAAGAGGATGTGATTTATACCTGGAAGGCCCTGGGGCJ^ 

CCATAATGGGTCCATCCTCCCCATCTCCTGGAGATGGGGAGAAAGTGATA 

GCGTTGCCAGGAACCCTGTGAGCAGAAACT^ 

GAAGGTGCTGCTGATGACCCAGATTCCTCCATGGTCCT 

CCTGCTC^GTCTCTTTGTACTGGGGCTATTTCTT 

AGTACATTGAAGAGAAGAAGAGAGTGK5ACATTTGTCGG 

TCTX5GAGAGAACACAGAGTACGACACAATCCCTCACACTAATAGAACAA 

TCCAGCAAATACGGTTTACTCCACTGTGGAAATAC^ 

TGCTCACGATGCCAGACACACCAAGGCTATTTGCCTATGAGAATC 
CACTCCCCTAAGTCTCTGCTCA 



WO 99/63088 ' PCI7US99/122S 



MAGSPTCLTLIYILWQLTGSAASGPVKELVGSVGGAVTFPLKSKVKQVDSIVWTFNTTPLVT 
IQPEGGTIIVTQNRNRERVDFPIX3GYSLKLSKIiKKNDSGIYWGIYSSSLQQPSTQEYVLHV 

YEHI^KPKVTMGLQSNKNGTCVTNLTCCMFJIGEEDVIYTWKALGQAA^ 
GESDMTFICVARNPVSRNFSSPILARKLCEGAADDPDSSMVLLCLLLVPLLLSLFVLGLFLW 

FLKRERQEEY IEEKKRVD ICRETPNI CPHSGENTEYDTI PHTNRTILKEDPANTVYSTVEI P 
KKMENPHSLLTMPDTPRLFAYENVI 
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FIGURE 172 

CTGGTTCCCCAACAT^ 

TCTGGACCCGTGAA&GAGCTGGTCGGTT^ 

CAAAGTAAAGCAAGTTGACTCTATTGTCTGGACC 

AGCCAGAAGGGGGCACTATCATAGTGACCCAAAATCGT^^ 

GATGGAGGCTACTCCCTGAAGCTCAGCAAA 

GGGGATATACAGCTCATCACTCCAGCAGCCCT 

AGCACCTGTCAAAGCCTAAAGTCACCAT^ 

ACCAATCraACATGCTGCATGGAACATGaSGAAGA 

GGGGCAAGCAGCCAATGAGTCCCATAATGGGTC 

AAAGTGATATGACCTTGATCTCC^ 

ATCCTTGCCAGGAAGCTCTCTGAAGGTGCTGCTGATGACCCAGATTC 
GTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCT 

TGAAGAGAGAGAGACAAGAAGAGTACATTGAAGAGAAGAAGAGAGTGGACATT^ 
ACTCCTAACATATGCCCXZCATTCTGGAGAGAACA<^ 

TAGAACAATCCTAAAGGAAGATCCAGCAAATACGGTTTACTCCACTGTGGAAATA 

AGATGGAAAATCCCCACTCACTGCTCTICGATC^ 

AATGTTATCTAGACAGCAGTGCACTCCCCTAAGTCT 
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FTOTTRE 173 



GAAAGACGTGGTCCTGAC^C^CAGACAATCCTATTCCCTACCAAA^AAGATGCTGCTGCT 

gctgtgtttgggactgaccctagtctx;tgtccatgcagaagaagctagttctacgggaagga 

ACTTTAATGTAGAAAAGATTAATGGGGAATGGCATACTATTATCCTGGCCTCTGACAAAAQA 

GAAAAGATAGAAGAACATGGCAACTITTAGACrrTTTTCTGGAGCAAATC^ 
TTCCTTAGTTCTTAAAGTCCATACTGTAAGAGATGAAGAGT6CTCCGAATTATCTATGGTTG 
CTGACAAAACAGAAAAGGCTGGTGAATATTCTGTGACGTATGATGGATTCAATACATTTAOT 
ATACCTAAGACAGACTATGATAACTTTCTTATGGCTCACCTCATTAACGAAAAGGATGGGGA 

AACCTTCCAGCTGATGGGGCTCTATGGCCGAGAACCAGATTTGAGTTCA 
GGTTTGCACAACTATGTGAGGAGCATGGAATCCTTAGAGAAAATATCATTGACCTATCCyVAT 

GCCAATCGCTGCCTCGAGGCCCGAGAATGAAGAATGGCCTGAGCCTCCAGTGTTGAGTGGAC 
ACTTCTCACCAGGACTCC^CCATC^TCCCTTCCTATCCATACAGCATCCCCAGTATAAATTC 
TGTGATCrGCATTCCATCCTGTCTCACTGAGAAGTCCAATTCCAGTCTATCAACATGTTACC 
TAGGATACCTCATCAAGAATCAAAGACTTCTTTAAATTTCTCTTTGATACACCCTTGACAAT 
TTTTCATGAAATTATTCCTCTTCCTGTTCAATAAATGATTACCCTTGC^CTTAA 
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FIGURE 174 

MKMLLLLCLGLTLVCVHAEEASSTGRNFNVEK I NGEWHT I ILASDKRBKI EEHGNFRLFLBQ 
IHVLENSLVLKVHTVRDEECSELSMVADKTEKAGEYSVTYIX3FNTFTIPKTDYDNFLMAHLI 
NEKDGETFQLMGLYGREPDLS SD I KERFAQLCEEHG I LRENI I DLSNANRCLQARE 



™» PCTA,s "' m52 

FTOTJRE 175 

GGCTCGAGCGTTTCTGAGCCAGGGGTGACC^^CCTGCTGCGAAGGATGGACATCCTGCAA 
TGGAITCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTG 

TCAGCTTAGTTGAGGAAC^CCAATTTTCTCS^ 

CCAGGAATTATAGGAGCAGGTCTGATGGCCATTCCAGCAACAACAATGTCCTTC 
AAAAACSAGCGTGCTGCAACAACAGAACTGGAATGTTTCm 

CAGTCATTCX3TGCTCTGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCT 

ATGTGTAATTCTCCAAGCAACAGTAATGCCAATTGTGAATTTTCATTGAAAAACATC^ 

CATTCATCCAGAATCCTTC^CTTGCAGTGGTTTT^ 

GTTTCAATAAACCCACCAGTAACGACACCATGGCGAGTGGCTGGAGAGCATCTAGTTTCC^C 
TTCGATTCTGAAGAAAACAAACATAGGCTTATCGACTTCTCAGTATTTTTAGGTCTATTGCT. 
TGTTGGAATTCTGGAGGTCCTGTTTGGGCTCAGTCAGATAGTCATCGGTTTCCTTC 
TGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTGTGTAGTTTAATGGGAATAAAATGTAAGTA 



TCAGTAGTTTGAAAAAAAAAAA 
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FIGURE 176 

MTCCEGWTSCTGFSLLVLLLLGVVI^ 
IPATTMSLTARKRACCNNRTGM 
NCEFSLKNISDIHPESFOTjQWFETO)SCAPPT 
IHFSVFLGLLLVGILEVLFGLSQIVI<3FIX5CL€GVSKRRSQIV 
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FTOTIRE 177 



GTCGAATCCAAATCACTCATTGTGAAAGCTGAGCTCACAGCCGAATAAGCCACCAISAGGCT 
GTCAGTGTGTCTCCTGATGGTCTCGCTG6CCCTTTGCTGCTACCAGGCCCATGCTCTTGTCT 

GCCC^GCrTGTTGCTTCTGAGATCACAGTCrrTCTTATTCTTAAGTGACG 

CAAGTTGCCAAACTTAATCCACCTCCy^AAGCTCTTGCAGCCAAGTTGGAAGTGAAGCACTG 

CACCGATCAGATATCTTTTAAGAAACGACTCTC^TTGAAAAAGTCCTGGTGGAAAS&GTGAA 

AAAATGTGGTGTGTGACATGTAAAAATGCTCAACCTGGTTTC 
CTGATCTTCACTAAAAATTGTAAAGGTTTCAACACGTTGCT^ 
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FIGURE 178 



MRLSVCLLMVSLMjCCYQAHALVCPAVASEITVFLFL^ 
KHCTDQI SFKKRLSLKKSWWK 



WO 99/63088 „ / PCT/US99/121» 

FTCTJRE 179 

ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGG 
TGAAGGAGCAGTGAGGAACCTGC^GAGTCAC^CAGTTGCTGACCAATTGAGCTGTGAGCCTG 
GAGCAGATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCrCCCCCTGCAGCCCTGCCCCTC 
GAACTGTGAC^ESGAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGG 
AAGCCAATGACCCATTTGCCAATAAAGACGATCCCTTCTACT^ 

CTGAGCGGACTGATCTGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGG 
CAAATGCAAATAC^GAGCAGCCAGAAGCAGCACAGTCCTGTACCrGAGAAGGCCATCCCAC 

TC^TCACTCCAGGCTCTGCCACTACTTCGCTC 

AGCCTAACACTGGCCCCCAGCACCTCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTC 
TCTCCAAGGGCAGGCTGTTAGGCCCCTTTCTGATCAGGAGGCTTCTTTATGAATTAAACTCG 



. CCCCACCACCCCCTCA 
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FIGURE 1 8 0 

MERVTLALLLLAGLTALEANDPFANKDDPFYYDWKNLQLSGLICGGLLAIAGIAAVLSGKCK 



YKS S QKQHS PVPEKAIPLI TPGS ATTC 
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PCTAJS99/12252 



GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGGS33TCGCTGCTGAGCCTGCCC 

TGGCTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTC 

CTGGCTACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCC 

AGTGTTTCCCACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCT 

ACAGAGGAGGGCTTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGT 

ATGGCTGGGTCCCATCATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCA 

CCAATGCCTCAGCTGCCATTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGG 

CTGGGAGAAGGGATACTGCTGAGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGAC 

GCCCGCCTTCCATTTC^CATCCTGAAGTCCTATATAACGATCTTCAACAAGAGTGCAAACA 

TCATGCTTGACAAGTGGCAGCACCTGGCCTCAGAGGGCAGCAGTCGTCTGGACATGTTTGAG 

CACATCAGCCTCATGACCTTGGACAGTCTACAGAAATGCATCTTCAGCTTTGACAGCCATTG 

TC^GGAGAGGCCCAGTGAATATATTGCCACCATCTTGGAGCTCAGTGCCCTTGTAGAGAAAA 

GAAGCCAGCATATCCTCCAGCACATGGACTTTCTGTATTACCTCTCCCATGACGGGCGGCGC 

TTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGACGCTGTCATCCGGGAGCGGCGTCG 

CACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAAAGCCAAGTCCAAGACTTTGG 

ATTTCATTGATGTGCTTCTGCTGAGCAAGGATGAAGATGGGAAGGCATTGTCAGATGAGGAT 

ATAAGAGCAGAGGCTGACACCTTCATGTTTGGAGGCCATGACACCACGGCCAGTGGCCTCTC 

CTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCGACAGGAGGTGC 

AAGAGCTTCTGAAGGACCGCGATCCTAAAGAGATTGAATGGGACGACCTGGCCCAGCTGCCC 

TTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATCTCCCG 

ATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTGCC 

TCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGAC 

CCCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTC 

CGCAGGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGG 

CGTTGATGCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAA 

TTGATCATGCGCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCA 

GTGACTTTCTGACCCATCCACCTGTTTTTTTGCAGATTGTCATGAA 
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-A bo /wo 
FIGURE 182 

MSLLSLPWLGLRPVAMSPWLLLLLW^ LAWTYAFYNNCRRLQCFPQPPKRNWFWG 
HLGLITPTEEGLKDST<^ATYSQGFTVWLGPI I PF I VLCHPDT I RS I TN AS AAI APKDNLF 
IRFLKPWLGEGILLSGGDKWSRHRRMLTPAFHFN^ 

SRLDMFEHISLMTLDSLQKCIFSFDSHCQERPSEYIATILELSALVEKRSQHIW3HMDFLYY 
LSHIX5RRFHRACRLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTI^ 
KALSDEDIRAEADTFMFGGHDTTASGLSWVLYNLARHPEYQERCRQEVQELLKD 
DDLAQLPFLTMCVKESLRLHPPAPFI SRCCTQDI VLPDGRVI PKGITCLIDI IGVHHNPTVW 
PDPEVYDPFRFDPENS KGRS PLAF I PFSAGPRNCIGQAFAMAEMKWLALMLLHFRFL PDHT 
EPRRKLELIMRAEGGLWLRVEPLNVGLQ 
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FIGURE 183 

CAACAGAAGCCAAGAAGGAAGCCGTCTATCTTGTGGCGAT(^ISTATAAGCTGGCCTCCTGC 

TGTTTGCTTTTCACAGGATTOTAAATCCrCTCrTATCTCTTCCTCTCCT 

AATATCCTTTC^CTCrrCAGCACCTCATGAAGACGCGCGCTTAACTCCGGAGGAGCTAGAAA 

GAGCTTCCCTTCTACAGATATTGCCAGAGATQCTGOGTGCAGAAAGAGGGGATATTCTCAGG 

AAAGCAGACTCAAGTACCAACATTTTTAACCCAAGAGGAAATTTGAGAAAGTTTC^GGATTT 

CTCTGGACAAGATCCTAACATTTTACTGAGTCATCTTTTGGCCAGAATCTGGAAACCATACA 

AGAAACGTGAGACTCCTGATTGCTTCTGGAAATACTGTGTCIOAAGTGAAATAAGCATCTGT 

TAGTCAGCTCAGAAACACCCATCTTAGAATATGAAAAATAACACSATGCTTGATTTGAAAAC 

AGTGTGGAGAAAAACTAGGCAAACTACACCCTGTTCATTGTTACCTGGAAAATAAATCCTCT 

ATGTTTTGCACAAAAAAAAAAAAAAA 
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-a 6*7^° 
FTCTJRE 184 

MYKLASCCI^FT6FLNPLLSLPLLDSREISFQLSAPHEDARLTPEEI£RASLI^ILPENI^3A 
ERGDILRKADSSTNIFNPRGNLRKFQDFSGQDPNILLSHLLARIVnCPYIOa^TPDCFWKYCV 



FTr.TTRF.1SS 

G^CATTTTTAGTTCCC^GGAATGTACATCAGCCCCACGGAAGCTAGGCCACCTCTGGOAT 
GGGGTTGCTGGTTTAAAACAAACGCCAGTCATCCTATATAAGGACCTGACAGCCACCAGGCA 

CCACCTCCGCCAGGAACTGCAGGCCCACCT^^ 

AGGGACCGTCTGCaiGCCTCCrGCTCCTCGGCATGCTCTGGCTGGACTTGGCCATGGCAGGCT 
CCAGCTTCCTGAGCCCTGAACACCAGAGAGTCCAGCAGAGAAAGGAGTCGAAGAAGCCACCA 
GCCAAGCTGCAGCCCCGAGCTCTAGCAGGCTGGCTCCGCCCGGAAGATGGAGGTCAAGCAGA 
AGGGGCAGAGGATGAACTGGAAGTCCGGTTCAACGCCCCCTTTGATGTTGGAATCAAGCTGT 
CAGGGGTTCAGTACCAGCAGCACAGCCAGGCCCTGGGGAAGTTTCTTCAGGACATCCrCTGG 
GAAGAGGCCAAAGAGGCCCCAGCCGAC^G^TCGCCCACAAGCCTTACTCACCTCrCTCT 
AAGTTTAGAAGCGCTCATCTGGCTTTTCGCTTGCTTCTGCAGCAACTCCCACGACTGTTGTA 
CAAGCTCAGGAGGCGAATAAATGTTCAAACTGTA 
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FIGURE 186 

MPSPGTVCSLLLliGMLWLDLAMAG^ 

GQAEGAEDELE VRFNAPFDVG I KLSGVQYQQHSQ ALGKFLQD I LWEEAKEAPADKO 
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FTOTIRE 187 

CGGCCACAGCTGGCATGCTCTGCCTGATCGCCATCCTGCTGTATGTCCTCGTCCAGTACCTC 

GTGAACCCCGGGGTGCTCCGCACGGACCCCAGATGTCAAGAATATGAACACGTGGCTGCTGT 

TCCTCCCCCTGTTCCCGGTGCAGGTGCAGACCCTGATAGTCGTGATCATCGGGATGCTCGTG 

CTCCTGCTGGACTTTCTTGGCTTGGTGCACCTGGGCCAGCTGCTCATCTTCCACATCTACCT 

GAGTATGTCCCCCACCCrAAGCCCCCGATCCCCCCAAGGCTGGGTGGTCAGAGCTGCTCATC 

TTACACCTCTACTTGAGTATGTCCCTAACCCTGAGCCCCCCACGCCTGGGGCCAGAGTCTTT 

GTCCCCCGTGTGCGCATGTGTTCAGGGTCAGCCTCTCCCAGAAGTGAGATCATGGACAAAAA 

GGGCAAATCACJVGGAAGAAATTAAATCCATGAGGACCCAGCAGGCCCAGCAAGAAGCTGAAC 

TCACGCCX5AGACCTGC^GGAGTGGTGCCAGGTGCT2GAAGTAACAAGTTTAAAATGTTCAGA 

GACAATGGAATGGAATCTATTAGGQ^GAACAGGACATTATGAAATAAGGACAGGTGGACTT 

CCAAAAACACAAGTAGAAATTCTAACAATGAAATATATTACAGGCAGGTCACCCACTAACCA 

AACAACTGAAGCGAGAGCTGTGGTCTTGCTTGGTCTCACAGTGGGCACAGCGGTAGGCGGTC 

AGTCATGTTGCTGAACGACGGAGGGTAAACTCCCCAGCCCCAAGAAAACCTGTGTTGGAAGT 

AAO^CAACCTCCCTGCTCCK3GCACCAGCCGTTTTGGTCATGGTGGGCCAGCTGC^^ 

TCTTCCATTCTCTGGGCAGTGGTGGCCCCGAGGCTGTGGCCTCTCAGGGGGTTTCTGTGGAC 

ACGGGCAGCAGAGTGTGTCCAGGCCAGCCCCCAAGAATGCCCTGCTCCTGACAGCTTGGCCA 

ACCCCTGGTCAGGGCAGAGGGAGTTGGGTGGGTCAGGCTCTGGGCTCACCTCCATCTCCAGA 

GCATCCCCTGCCTGCAGTTGTGGCAAGAACGCCCAGCTCAGAATGAACACACCCCACCAAGA 

GCCTCCTTGTTCATAACCACAGGTTACCCTACAAACCACTGTCCCCACACAACCCTGGGGAT 

GTTTTAAAACAGACACCTCTAACGCATATCTTACAGTCACTGTTGTCTTGCCTGAGGGTTGA 

ATTTTTTTTAATGAAAGTGCAATGAAAATCACTGGATTAAATCCTACGGACACAGAGCTGAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 188 

MNTWLLFLPLFPVQVQTLI WI IGMLVLLLDFLGLVHLGQLL IFHI YLSMSPTLSPRSPQGW 
VVRAAHLTPIiLEYVPNPEPPTPGARVFVPRVRMCSGSASPRSEIMDKKGKSQEEIKSMRTQQ 
AQQEAELTPRPAGWPGA 
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FTGTTRE 189 

(XaAGTGCAGATGGCATCCTTCGGTTCT 

ATGGAGCTCTCGAAGGCCTTCTCTGGCCAGCXKs^ 

ATCACTCAGCTTCTCCACAACATCCCrrGCTCAGCAACT 

TGCCCAAGCCCCTGTGCGAGAAAGGTCTGGCAGCCAAGTGCTTTGACATGCCAGTGTCCCTG 

GATGGAGATACCAACACATCCACCCAGGAGGTGGTACAATACAACTGGGAGACTGGGGATGA 

CCGGTTCTCCTTCCGGAGCTTCCGGAGTGGCATGTGGCTATCCTGTGAGGAAACTGTGGAAG 

AACCAGGGGAGAGGTGCCGAAGTTTCATTGAACTTACACCACCAGCCAAGAGAGGTGAGAAA 

GGACTACTGGAATTTGCCACGTTGCAAGGCCCATGT(^CCCCACT 

GCGGTTGATGGAGAAGGCXTCCCTCCCCrCCCCTCCCTTGGG^ 

TGGTTATCCCTGGGAACGCAGATCACCTACATCGGACTTCT^TTCATCAGCTTCCTCCTGCT 
ACTAACAGACTTGCTACTCACTXXMAACCCTGCCTGT 

CTGTTTCCTCIX3TCCTGTCAGGTCTCCTGGGGATGGTGGCCCACATGATGTATTCACAAGTC 
TTCCAAGCGACTGTCAACTTGGGTCCAGAAGACTGGAGACCACATGTTTGGAATTATGGCTG 
GGCCrTCTACATGGCCTGGCTCTCCTTCACCTGCTGCATGGCGTCGGCTGTCACCACCTTCA 
ACACGTACACCAGGATGGTGCTGGAGTTCAAGTG CAAG CATAGTAAGAGCTTCAAGG AAAAC 
CCGAACTGCCTACCACATCACCATCAGTGTTTCCCTCGGCGGCTGTCAAGTGCAGCCCCCAC 
CGTGGGTCCTTTGACCAGCTACCACCAGTATCATAATCAGCCCATCCACTCTGTCTCTGAGG 
GAGTCGACTTCT ACTC CGAG CTGCGGAACAAGGG ATTT CAAAGAGGGGCCAGCCAGGAGCTG 
AAAGAAGCAGTTAGGTCATCTGTAGAGGAAGAGCAGTGTTAGGAGTTAAGCGGGTTTGGGGA 

GTAGGCTTGAGCCCTACCTTACACGTCTGCTGATTATCA^ 

TCTCTTGAGCATGGTTTTTAGAGGCTACGAATAAGGCTATGAATAAGGGTTATCTTTAAGTC 
CTAAGGGATTCCTGGGTGCCACTGCTCTCTTTTCCTCTACAGCrrCCATCTTGTTTCAC 
CCCACATCTCACACATCCAGAATTCCCTTCTTTACTGATAGTTTCTGTGCCAGGTTCTGGGC 
TAAACCATGGAGATAAAAAGAAGAGTAAAATACACTTCCCGACCTTAAGGATCTGAAA 



WO 99/63088 , PCT/US99/122S2 

FIGURE 190 

MAKMELSKAFSGQRTLLSAILSMLSLSFSTTSLLSNYWFVGTQKW 
VSLIX5DTNTSTQEWQYNWETGra 

GEKGLLEFATLQGPCHPTLRFGGKRLMEKASLPS PPLGLCGKNPMVI PGNADHLHRTS IHQL 

PPATNRLATHVTCPCLWAQTERLCCCFL^ 

LWLGLLHGLALLHLLHGV^ 
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FTGIJRE 191 

AACTGGAAGGAAAGAAAGAAAGGTCAGCTTTGGCCCAG&3E9TGGTTACCCCTTGGTCTCCTG 
TCTTTATGTCTTTCTCCTCTTCCTATTCTGTCATCTCCCTCACTTAAGTCTCi^GGCCTGTCA 
GCAGCTCCTGTGGACATTGCCATCCCCTCTGGTAGCCT 

TTATGGATGTTTCCACCAACCAGGGTAGTGGCATGGAGCACCGTAACCATCTGTGCTTCTGT 
GATCTCTATGACAGAGCCACTTCrCCACCTCTGAAATGTTCCCTGCTCTGAAATCTGGCATG 
AGATGGCACAGGTGACCACGCAGAAGCCACCAGAATCTTGCCTGCCCTATTCCTCCTCCCAA 
GTCTGTTCTCTTATTGTCAACCTCAGCA(^CAGGCTGGCGCCAATGGCATTACAGAGAAAG 
CAATCTGTGTGGCTAGTGGGCAGATTACCATGCAAGCCCCAGGAGAAATGGAGGAGCTTTGT 
AGCCACCTCCCTGTCAGCCAGTATTAACATGTCCCCTTCCCCCTGCCCCGCCXTrAGATTCAG 
GACATTCG CCCCTGTGTGCCAC CAAACCAGGACTTTC CCCTTGGCTTGGCATCCCTGGCT CT 
CTCCTGGTACCGAGCAAGACGTCTGTTCCAGGGCAGTGTAGGATCTTTCAAGCTCCGTTACT 
ATGGCGATGGCCATGATGTTACAATCCCACTTGCCTGAATAATCAAGTGGGAAGGGGAAGCA 
GAGGGAAATGGGGCCATGTGAATGGAGCTGCTCTGTTCTCCCTACCCTGAGGAAAAACCAAA 
GGGAAGCAAC^GGAACTTCTGCAACTGGTTTTTATCGGAAAGATCATCCTGCCTGCAGATGC 
TGTTGAAGGGGCACAAGAAATGTAGCTGGAGAAGATTGATGAAAGTGCAGGTGTGTAAGGAA 
ATAGAACAGTCTGCTGGGAGTCAGACCTGGAATTCTGATTCCAAACTCTTTATTACTTTGGG 
AAGTCACTCAGCCTCCCCGTAGCCATCTCCAGGGTGACGGAACCCAGTGTATTACCTGCTGG 
AACCAAGGAAACTAACAATGTAGGTTACTAGTGAATACCCCAATGGTTTCTCCAATTATGCC 
CATGCCACCAAAACAATAAAACAAAATTCTCTAACACTGAAA 
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FIGURE 192 

MWLPLGLLSLCLSPLPILSSPSLKSQACQQLLWTLPSPLVAFRANRTTYVMDVSTNQGSGME 
HRNHLCFCDLYDRATSPPLKCSLL 
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FTCTTRF, 193 

GTAGCGCGTCTTGGGTCTCCCGGCTGCCGCTGCTGCCGCCGCCGCCTCGGGTCGTGGAGCCA 
GGAGCGACGTCACaSCC^TG^CAGGCATCAAAGCrTTGATTAGTTT 

ATCGGACTGATGTTTTTGATGCTTGGATGTGCCCTTCCAATATACAACyU^TACrGGCCC^ 
CTTTGTTCTATTTTTTTACATCCrTTCACCT 

ATGATACAGATGCTATGAGTAACGCTTGTAAGGAACTTGCCATCTTTCTTACAACGGGCATT 
GTCGTGTCAGCTTTTGGACTCCCTATTGTATTTGCCAGAGCACATCTGATTGAGTGGGGAGC 

TTGTGCACTTGTTCTCACAGGAAACACAGTCATCTTTGCA^ 

TCTTTGGAAGCAATGACGACTTCAGCTGGCAGCAGTGG2Syy^AAGAAATTACT 

TCAAATGGACTTCCTGTC^TTTGTTGGCCATTCAC^ 

TGAATGGTATAGCAAGCCTCTTGGGGGTATTTTAGGTGCTCCCTTC TCACTT TTATTO 

CATACTATTTTCACAGAGACTTGCTGAAGGATTAAAAGGATTTTCTCrrrrGGAAAAGCTTG 

ACTGATTTCACACTTATCTATAGTATGCTTTTTGTGGTGTCCTGCTGAATTTAAATATTTAT 

TGTAATC^TTTGCATTGGTTAGGAATTCAGAATTCCGCCGGCTCTATTACTGGTCAAGTACA 

TCTTTTCTCTTAAAATTATTTAGCCTCCATTATTACAAAAAATTATAAAAATAAGTTTTCyWS 

TCAGTCAGGATGACATCACTCCCAATGTTATGCAGACATACAGACGGTTGGCATACGTTATA 

GACTGTATACTCAGTGCAAATATAGCTGCATTTATACCTCAGAGGGGCCAAGTGTTAATGCC 

CATGCCCTCCGTTAAGGGTTGTTGGTTTTACTGGTAGACAGATGTTTTGTGGATTGAAAATT 

ATTTTATGGAATTGCTACAGAGGAGTGCITTTCTTCTCAATTGTTAGAAGAATTTATO 

ACTTTAAGGTAAGGGTGTAAAAACATTTTTGAGATAAGGTTTTTATTTATGTTTATTATTGT 

TAGAGTGAGTTGCAATGTGGGAAGAAATGACATTGAAATTCCAGTTTTTGAATCCTGTTTCT 

ATTTATAAGTGAAATTTGTGATCTCCTATCAACCTTTCATGTTTTACCCTGTTAAAATGGAC 

ATACATGGAACCACTACTGATGAGGGAC^GTTGTATGTTTGCATCATATATGCCAGAAAACC 

TTCCTCTGCTTCCTCCTTTTGACTTATTTGGTATGTTGTATATATTACATAAAATAACTTTT 

CAAATATAGTTTAATAACACTTAGAAGTGTTTACTTACCTGGAAAATAATTGCTATGCCGTA 

CATTCAGAGTGCCCCCTCCCCTGCAAGGCCTTGCCATGATTAACAAGTAACTTGTTAGTCTT 

ACAGATAATTCATGCATTAACAGTTTAAGATTTAGACCATGGTAATAGTAGTTCTTATTCTC 

TAAGGTTATATCATATGTAATTTAAAAGTATTTTTAAGACAAGTTTCCTGTATACCTCTGAA 

CTGTTTTGATTTTGAGTTCATCATGATAGATCTGCTGTTTCCTTATAAAAGGCATTTGTTGT 

GTGAGTTAATGCAAAGTAGCCAAGTCCAGCTATATAGCAGCTTCAGAAACATACCTGACCAA 

AAAATTCCCAGTAACCAGGCATGATCAATTTATAGTGGTCGTTTACATCT AATA ATTATCAG 

GACTTTTTTCAGGAGTGGGTTATAAAAACATTCAAGTTGGTCTGACAGTATTTTGTTAAGGA 

TATTTGTTTGTATGTTTATTCAGTATACrTACATAAAAATTATTTCGCCATCAGCC^AAACT 

CAGTAATCATGACAGCTGTCTGTTGTTTTATGAAGTTTATTTCTCAAGAAAATGGGAATAAA 

TTTGGGATTTGTTCAGCTTTTTTACTAAAGATGCCTAAAGCCACAGGTTTTATTGCCTAACT 

TAAGCCATGACTTTTAGATATGAGATGACGGGAAGCAGGACGAAATATCGGCGTGTGGCTGG 

AGCCTTCCCACTGGAGGCTGAAAGTGGCTTGTGGTATTATAATGTTCAGATTTCAAGAGGAA 

GGTGCAGGTACACATGAGTTAGAGAGCTGGTGAGACAGTTGGGAACTCTTTGTGCTTGTGAT 

CrACTGGAC TlTlTrT TTGCAGGAAGTGCATTCTCTGGTCCTTCCCTATTTTCTGTTCTGGA 

TGTCAGTGCAGTGCACTGCTACTGTTTTATCCACTTGGCCACAGACTTTTTCTAACAGCTGC 

GTATTATTTCTATATACTAATTGCATTGGCAGCATTGTGTCTTTGACCTTGTATACTAGCTT 

GACATAGTGCTGTCTCTGATTTCTAGGCTAGTTACrTGAGATATGAATTTTCCATAGAATAT 

GCACTGATACAACATTACCATTCTTCTATGGAAAGAAAACTTTTGATGATGAAAC^TAAAG 

ATTTTAAATATCTATTTTAAAAAAAAAA 
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FIGURE 194 

MAGIK^ISLSPGGAIGLMFLMIiGCALPIYNKYWPLPVLFFYILSPIPYCIARRLVDDTDAM 
SNACKELAIFLTTGIVVSAFGLPIVFARAHLIEWGACALVLTGNTVIFATILGFFLVFGSND 

DFSWQQW 
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\*>£/OVO 

FTGURE 195A 

CCC^CGCGTCCGCCC^CGCGTC^ 

CACGCGTCCGCCCACGCGTCCGGTGCAAGCTCGCGCCGCACACTGCCTGGTGGAGGGAAGGA 
GCCCGGGCGCCTCTCGCCGCTCCCCGTOCCGCCGTCCGCACCTCCCCACCGCCCGCCGCCCG 
CCGCCCGCCGCCCGCAAAGCATGAGTGAGCCCGCTCTCTGCAGCTGCCCGGGGCGCGAATGG 
CAGGCTGTTTCCGCGGAGTAAAAGGTGGCGCCGGTCAGTGGTTO 

AACCAGACTGTCAGATCCTGGGGAGTCGCGAGCCCCGAGTTTGGAGTTTTTTCCCCCCAC^ 

CGTCACAGTCCGAACTGCAGAGGGAAAGGAAGGCGGC^GGAAGGCGAAGCTCGGGCTCCGGC 

ACGTAGTTGGGAAACTTGCX3GGTCCTAGAAGTCGCCTCCCCGCCTTGCCGGCCGCCCTTG 

GCCCCGAGCCGAGCAGCAAAGTGAGACATTGTGCGCCTGCCAGATCCGCCGGCCGCGGACCG 

GGGCTCCCTCGGAAACACAGAGGGGTCTTCTCT 

AGGGAGCAGCTGAATGGAGGTTGTCACTCTCTGGAAAAGGATTTCTGACCG^ 

TGGACATTCTCC^GTCTCTCTGGAAAGATTCTCGCTAAIGGATTTCCT 

GTCTATACTGGCTGCTGAGGAGGCCCTCGGGGGTGGTCTTGTGTCTGCTGGGGGCCTGCTTT 

CAGATGCTGCCCGCCGCCCCCAGCGGGT^CCGCAGCTGTGCCGGTGCGAGGGGCGGCTGCT 

GTACTGCGAGGCGCTCAACCTCACCGAGGCGCCCCACAAC^ 

CCCTGCGCTACAACAGCCTCTCGGAGCTKXX3CGCCGGC 

ACGTGGCTCTATCTGGATCACAATCAC^ 

GCGCCGAGTTAAGGAACTCACGCTGAGTTCCAACCAGATCACCCAACTGCCCi^ 

TCCXK3CCCATGCCCAACCTGCGCAGCGTGGACCTCTCGTACAACAAGCTGCA 

CCCGACCTCTTCGACGGGCTGCGK3AAGCTCAC 

GTTTGTGCCCGTGCGCATCTTCCAGGACTGCCGCAGCCTCAAGTTTCTCGACATCGGATACA 

ATCAGCTCAAGAGTCTGGCGCGCAACTCTTTCGCCGGCTTGTTTAAGCTG!ACCGAGCTGCAC 

CTCGAGCACAACGACTTGGTCAAGGTGAACTTCGCCCACTTCCCGCGCCTCATCTCCCTG^ 

CTCGCTCTGCCTGCGGAGGAACAAGGTGGCCATTGTGKj^ 

ACCTGGAGAAAATGGACTTGTCGGGC^CGAGAT^ 

ACCGTGCCGCACCTGCAGTCCCTGCAGCTGGACTCCAACCGCCTCACCT 
GATCCTCAACTCTTGGAAGTCCCTGACAAGCATCAC^ 

GGCGCAACGTGTGTGCCCTAGCCTCGTGGCTCAGGAACTTCCAGGGGCGCTACG^ 

TTGCAGTGCGCCAGCCCGGAGTACGCACAGGGCGAGGACGTCCTGGACGCCGTGTACGCCTT 

CCACCTGTGCGAGGATGGGGCCGAGCCCACCAGCGGCCACCTGCTCTCGGCCGTCACCAACC 

GCAGTGATCTGGGGCCCCCTGCCAGCTCGGCCACCACGCTCGCGGACGGCGGGGAGGGGCAG 

CACGACGGCACATTCGAGCCTGCCACCGTGGCTCTTCCAGGCGGCGAGCACGCCGAGAACGC 

CGTGCAGATCCACAAGGTGGTCACGGGCAC 

TCCTGGTGCTCTACGTGTCCTGGAAGTGTTTCCCAGCCAGCCTCAGGCAGCTCAGACAGTGC 

TTTGTCACGCAGCGCAGGAAGCAAAAGCAGAAACAGACCATG 

TGCCCAGGAATACTACGTTGATTACAAACCGAACCACATTGAGGGAGCCCTGGTG 

ACGAGTATGGCTCGTGTACCTGCCACCAGCAGCCCGCGAGGGAATGCGAGGTGTgStTTGTCC 

CAGTGGCTCTCAACCCATGCGCTACCAAATACGCCTGTO 

CCAGGCTGGGGTCTCCTTGTCTGTGCTCTGATATGCTCCTTGACTGAAACTTTAAGGGGATC 
TCTCCCAGAGACTTGACATTTTAGCT^ 

AACAAAAAACCCCACCCC^CAACCTTCAGGACAGTCTATCTTAAATTTCA 

TTCCTCCCTTTGAAGATCTGTCCATATTCAGGAATCTGAGAGTGTAAAAAAGGTGGCCATAA 

GACAGAGAGAGAATAATCGTGCTTTGTTTTATGCTACTCCTCCCACCCTGCCCATGATTAAA 

CATCATGTATGTAGAAGATCTTAAGTCCATACGC^TTTCATGAAGAACCATTGGAAAGAGGA 

ATCTGCAATCTGGGAGCTTAAGAGCAAATGATGACCATAGAAAGCTATGT^ 

TGTGTGTCTGTATGTTTCTGCGTTGTGTGTCT^^ 

CGGGAATTTAGCTCACATCATTTCATGCCCCTC 

AGGTGGGGGGAAACGGCAGGAATAAGGGAAAGTGGTAGTTT^ 

TGAAATCTTTTCTTTCTCAAATTAATTATCTTTAAGCTTCAAGAAACTO 

TAAGCAAACTACTAAGCATTTAAAAGAGAATCTAATTTT^ 

TATTCTTCCCACAGAGGGTGCTAATCTCATTATGCTGTGCTATCTGAAAAGAACTTAAGGCC 

ACAATTCACGTCTCGTCCTGGGCATTGTGATGGA^ 

GCTGATTAAAGTTCAGCAGTGGTATTGAGGTTTTTC 

TTCACATGACAAATGACACTCTCACACCAGTCT^ 

GAGGAAGCAGGTTAAATGAGACCTGTCCTCTGCTGCACT 

TGCTCAGATCTTAGCCTTGATATTAATAGTTGAGACCACCTACCCACAATGCAGCCT 
CCCAAGACTACAT^GTTACCATCGCAAAGGAAAGGTTATTCCAGTAAAAGGAAATAGTTTTC 
TCAACCATTTAAAAATATTCITCTGAACTCAT 
CTGCCTTCAAGAAGGCAGACATTTGGTATGATTTAGCATCAAG^ 
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FIGURE 195B 

GTAAGTAATCAGAGGGGCAAATGCCACTTGTTATTC 

ACAGATCTCTGGTAGGATTAGGGGCCACTC^ 

CAAGTTTGATGCCTAGTCTATCTGACATGGCCC^ 

AGATGGTAGAAGGAACATCATCACATA<XICCTCTCACAGAGAAAATTATQ^ 

ATTATATCTGTTTTGGAGCAAGAGTGTCATAATGTTTCAGGGTAGTCAAAATAA^ 

TATCTCCTCTAGATGAGTGGCGATGTTGGCTGATTTGGGTCTGCCATTGACAGAATC 

TAAAAAGGAATTAGCTAGAATATGACCATTAAATGTC 

TTTAGAATGTCA 



WO 99/63088 „ / PCT/US99/122S2 

FTOTTRE 196 

MDFLLLGLCLYWLLRRPSGVVLCLLGACFQMLPAAPSGCPQLCKCEGRLLYCEALNLTEAPH 
NLSGLLGLSI^YNSLSELRAGQPTGLMQLTWLYLDHNHICSVQGDAFQKLRRVI^TLSSNQ 
ITQLPNTTFRPMPNLRSVDLSYNKLQALAPDLFHGLRKLTTLHMRANAIQFVPVRIF 
LKFLDIGYNQLKSLARNSFAGLFKLTELHLEHNDLVK^ 

VS SLDWVWNLEKMDLSGNEI E YME PHVFETVPHLQS LQLDSNRLTY I EPRI LNSWKSLTS IT 

LAGOTjWIX^GRNVCALASWLSNFQGRYDGNLQCASPEYAQGEDVLDAVYAFHLCT 

HLLSAVTNRSDLGPPASSATTLADGGEGQHDGTFEPATVALPGG 

LIFSFLIVVLVLYVSWKCFPASLRQI^QCFVTQRRKQKQKQTMHQMAAMSAQEYYVDYKPNH 
IEGALVIINEYGSCTCHQQPARECEV 
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FIGURE 197 

GTGC^GGAGCCGAGGCGAGATGGGCGTC 

GCACTGACCCAGGCGGTCTCCAAACTCT 

CTGGAGCCAGAACO^CCCCXSTGaSCCGGC^ 

TGTCAGTCCTGGTGCAAGAAGGTCACGCCGTCTCAGACATGOT 

CTCGTCCTGGCTTCAGGAGCCXK^TTCXSGCGTCT 

CGCX5GGCGAACCTGCCGTCTTCCGCGACTCTGACCGCT 

GGCGCTCTGGGGACGAGGCACCTGGCCTCTTC^ 

CACGACGACXjTCTTCTTT C CGCCTAGTGCXZTCCTTCCGCGTGGGGCT 

CCCCGTGCGTGTCCGCAG^TCTCGGCTCTGGGCCGGAC^ 

CTGTTTTCCTGGOTTCCCGCGCGGGCCGCCTACGCT^ 

GGCCCCGAGGACTGCX3CGGACCCGTCGGGCTGCGTCTGCGGCAACGC 

GATCTGOTGGGCX!CIX3CTCCAGCCXXrr 
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FTGTTRE 198 



MGVLGRVLLWLQLQ^TQAVSKLWVPNTDFDVAANWSQNRTPCAGGAVEFPM)KMVSVLVQE 
GHAVSDMLLPLDGELVLASGAGFGVSDVGSHLDCGAGEPAVFRDSDRFSWHDPHLWRSGDEA 
PGLFFVDAERVPOIHDDVFFPPSASFRVGLGPGASPVRVRSISALGRTFTRDEDLAVFLASR 

AGRLRFHGPOALSVGPEDC^PSGCVCGNAEAQPWICAALLQP 
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FIGURE 199 

AT<^CATCAATTGGGAGTACCATCTTCCT(^TOGGACCAGTGAAACAGCTGAAGC^ 

TGAGCCTACTCGTTTGATTGCAACTATCATGGTGCrGTTC 

CTGCCTTTTGGTGGCATAACAAGGGACT1X3CACTTATCTTCTGCA 

TTGACGTGGTACAGCCTTTCCTTC&TACCATTTGCAAGGG 

CGTGTGTCTTGCASAAXTCATGGCCAGTTTTATC 

CTGGTGGACAGTTTTGTAACTATCTTCGAAACCTCTGTCTTA^ 

TGCAGC^TGTGTTGCTTGTGATTCGAAC^TTTGAGGGTTACTTTTGGAAG 

TCTCGAACCT<1AATGTCAGTAGCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAG 
TCTTCCTGATGTACCTGTTTCCTCTCK^ATC 

CTATTCAGCAACAGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAA 
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FTOTTRE 200 

MGPVKQLKRMFEPTIUjIATIMVLLCFALTLCSAFWWHNKGLALIFCILQSLALTWYS 



FARDAVKKCFAVCLA 
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FTCTJRE 201 

TTGAGCGCA<^TGAGCTCCTGCG03TTCCGGGGGCGTTCCTCC^GTCA<:CCrCCCGCCGTTA 
CCCGCGGCGCGCCCGAGGC^GTCTCCTC<^GACCCTCCCTCCCGTTGCTCCAAACTAATACG 
GACTGAACGGATC<3CTGCGAGGGTGGGA<^GAAAATTAGGGGGAGAAA<5GACAGAGAGAGCA 
ACTACCATCCATAGCC^GATAGATTATCTTACACTGAACTGATCAAGTACTTTGAAAATGAC 
TTCGAAATTTATCTTGGTGTCCTTCATACTTGCrGCA 

TCCAACTAGACCAGCAAAAGGTTCTACTAGTTTCTTTTGATGGATTCKX3TTGGGATTAOT 

TATAAAGTTCCAACGCCCCATTTTCATTATATTATGAAATATGGTGTTCACGTGAAGCAAGT 

TACTAATGTTTTTATTACAAAAACCTACCCTAACCATTATACTTTGGTAACTGGCCTCTTTG 

cagagaatcatgggattgttgcaaatgatatgtttgat<x:tatt<x^ 

TTGGATCACATGAATATTTATGATTCCAAGTTTTG<3GAAGAAGCX^<^CCAATATGGATCAC 

aaaccagagggcaggacatactagtggtgcagccatgtggcccggaacagatgtaaaaatac 

ATAAGCGCTTTCCTACTCATTACATGCCTTACAATGAGTCAGTTTCATTTGAAGATAGAGTT 

GCCAAAATTGTTGAATGGTTTA<X3TCAAAAGAGCCCATAAATCTTGGTCTTCT 

AGACCCTGATGACATGGGCCACCATTTGGGM 

CAGATATTGACAAGAAGTTAGGATATCTCATACAAATGCTCAAAAAGGCAAAGTTGTGGAAC 

ACTCTGAACCTAATCATCACAAGTGATCATGGAATGACGCAGTGCTCTGAGGAAAGGTTAAT 

AGAACTTGACCAGTACCTGGATAAAGACCACTATACCCTGATTGATCAATCTCCAGTAGCAG 

CCATCTTGCCAAAAGAAGGTAAATTTGATGAAGTCTATGAAGCACTAACTCACGCTCATCCT 

AATCTTACTGTTTACAAAAAAGAAGACGTT CCAGAAAGGTGGCATTACAAATACAACAGTCG 

^TTCAACCAATCATAGCAGTCGCTGATGAAGGGTG 

ACTTTCTGTTAGGCAACCACGGTTACGATAATGCGTTAG 

GCCCATGGTCCTGCCTTCAGAAAGAATTTCTCAAAAGAAGCCATGAACTCCACAGATTTGTA 
CCCACTACTATGCCACCTCCTCAATATCACTGC^ 

TCCAGGATCTGCTCAATTCAGCAATGCCAAGGGTGGTCCCTTATACACAGAGTACTATACTC 
CTCCCTSGTA^TOTTAAACC^^ 

GGTCTCTCTTGGCAG<^TTATAGTGATTGTATTTTTTGTAATTTTCATTAA^TTTAATTC 
ACAGTCAAATACCTGCCTTACAAGATATGCATGCTGAAATAGCTCAACCATTATTACAAGCC 
TAATGTTACTTTGAAGTGGATTTGCATATTGAAGTGGAGATTCCATAATTATGTCAGTGT 

A^GGTTTCAAATTCTGGGAAAC<»GTTCCAAA<^TCTGCAGAA^ 
ATTTAGGTATACACACACAC^CA<*CACACA<^T^ 
CTWy^GGAATAAAGATGTGAGAGTATGTCTCCATTGTTCACTGTAC 
AGATCCTGCTTTATTTGGACTTGGCX3CAGATAATGTATATATTTAGCAACTTTGCACTATGT 

AAAGTACCTTATATATTGCACTTTAAATTTCTCT 
CTTTATGGACAGTTATGTCTTATAACITGATTGA 
AGAATACTTGTTACGCATTGTTCAAACTGAAGGAAATTTCTA^ 

TAGAAATCTATCTCCATAAATTGA^AGAAGAAGAAGGTGATAAGTGTTGAAAATTAAATGTG 

ATAACCTTTGAACCTTGAATTTTGGAGATGTATTCCCAA^^ 
TTCTTGTCTTATTTCTTTCCAGAGAAOGTGGTTTTCATTTATTTTO 
AAATACTGACAGATTCGTTCTAAATATATTGTTTCTGTCATAAAATTATTGTGA 
TGAGTCATATTACTGTGATTTT<^TAATAATCAAGACACCATGAATATACl"lTi'CiTCTATA 

TAGTTCAGOATGGCCTGAATAGAAGCAACCAGGCACCATCTCAGCAATGTTTTCTC 

TGTAATTATTTGCTCCTTTGAAAATTA^ 

AAAAAAAAAAAAAAAAAAA 
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FIGURE 202 

NrrSKFILVSFILAALSLSTTFSLQLDQQKVLLVSFDGFRWD 
QVTNVFITKTYPNHYTLVTGLFAENHGIV^ 

ITNQRAGHTSGAAMWGTDVKIHKRFPTHYMPYNESVSFEDRVAKIVEWFTSKEP 
WEDPDDMGHHLGPDSPLMGPVI SDIDKKLGYLIQMLKKAKLWNTLNLI ITSDHGMTQCSEER 
LIELDQYLDKDHYTLIDQSPVAAILPra 

SRIQPIIAVADEGWHILQNKSDDFLLGNHGYDNALADMHPIFLAHGPA 
LYPLLCHLLNITAMPHNGSFWNVQDLLNSAMPRVVPYTQSTILLPGSVKPM 
IGVSLGS 1 1 VI VFFVI FI KHLIHSQI PALQDMHAE I AQPLIfiA 



FTftTJRE 203 

GGATTTTTGTGATCOSCGATTC^ 

(^CAGGCCXZACCCTGC<3GGGCGGGAGGCAGCCGGGGTGAGGGAGGTGAAGA 
AGAGAGGCCAAGCCCCTTGCCTTGGGTCAC^CAGCGA^ 
CCAGATCCAGAGGCAACAGGGACA^IGGCCACCTGGGACGAAAAGGC^ 
^GGTGGCTCCCGCTGAGAGGATGAGCAAGTTCTTAAGGCACCT 

CTACCATGCCTGGAACATC^CTACAAGAAATGGGAGAATGAAGAGGAGGAGGAGGAGGAGG 

AGCAGCCACCACCCACACCAGTCTCAGGCGAGGAAGGCAGAGCTGCAGCCC^ 
CCTGCCCCTGGCCCCGCACCCAGGGCCCCCCTTGACTTCAGGGGC^TGTTGAGGAAACTGTT 

CAGCTCCCACAGGTTTCAGGTCATCATCAOT^ 

TTGCTGAGCTCATCCTGGACCTGAAGATCATCCAGCC<^CAAGAATAACTATGCTGCCATG 

GTATTCCACTAGATGAGCATCACCATCTTGGTCTTTTTTA 

ATT TGTCTTCCGCCTGAGTTCTTTCACCAC^ 

TGGTCTCATTCATCCTGGAC^TTGTCCTCCTGTTCCAGGAGCACCAGOT 
CTGCTGATTCTGCTCOSGCTGTGGCGGGTGGCCCGGATCATC^ 
XAAGACACGTTGAGAACGGGAACTCTTAAGGTTAAAACAGATGAATGTAC^ 
AGATTCAACACCTTGAGTTCAGCTGCTCTGAGAAGCCCCTGGACT^TGAGTTTGCTGTATC 

--AAG&TGTAAGGAGAA<XTCTCT.CCGGATGS.CT^ 

CACACAGCCACCGTGAAAGTCCTGGAGTAAAATGTGCTGTGTACAGAAGAGAGAGAAGGAAG 

CAGGCTGGCATGTTCACroGGCTGGTGTTAOGACAGAGAA^ 

TCACTTCAGATTACAAATCACACAGAGCATCTGCCTGTTTTCAATCACAAGAGAACAAAACC 

AAAATCTATAAAGATATTCTGAAAATATGACAGAATTT<3ACAAATAAAAGCATAAA<^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FTGTIRE 204 

MATWDEKAVTRRAKYAPAERMSKFLRHFTW 

SC3EEGRAAAPDVAPAPGPAPRAPLDPRGMLRKLFSSHRFQVI 1 1 CL WLD ALLVLAEL I LDL 
KI I QPDKNNYAAMVFHYMS I TI LVFPMME 1 1 FKLFVFRLSS FTTSLRS WMPVWWSF ILDI 
VLLFQEHQFEALGLLILLRLWRVARIINGI^^ 
CSEKPLD 
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WO99/«3088 ^ 3f0 

FTCTTRE 205 

SS^SSaaaog^ 

S^TGGAAAACA^^ 

GGTGAGAATCTAAOGGTCCCGGATGAGTTCACAGTG^ 



gagacaccWggc*^^^ 

ACAGGC1 
GGCTTC2 



ACAGGCTCAAC^TCCTCkTGCAGGTC 
GGC^ScTCAGATGATTCGAGA^ 
^CAAAATT^ 



^SctaSccctggccctagtcaggacccg^ 
S^SctScca^g^ 

iCGAGAA 
5TGGACT 
1AACACT 
iAGAGCT 



TCTACGAGAACCTGAAGATCACCCTGGGCGTG^^ 
LCTCGCTGATCC 
lCTAAGCTGTCTC 
3CTGGCAGGCCC; 



rrAGTGGAOTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGCAGCCATCTC^TTCTCTGAATG 

^Saacaotaag^tctcg^ccaagctgtgaaaaccct 

fiOGGAGAGCT<^Q 
A^SC^A^GACCACAGG^^ 



rr^^CCTGACTTCC^ 

gatctg«£™ttc^^ 
^JSagtSggg^tc^ 

??^??cSagS^cctcgaccctgtcaggatgggccccacctcac 

Jtc^TCGQGAC^^ 

Sa«5ggg<2gc^^ 



ar^T^-imCAAGTTCATTTTTTATTCATATTTATGTTCATG^ 
CTCTCCAGAGAAATTCCTTTIG^ 

otcottcct^ca^ 

^G^GTCGAGAGAGAGGGAGGAACCTCAATAACCTTGA^ 

SSotctgctcc^ttccttaacaatgc^^ 
at?coSg^ 

SSaStatcaacatataactggagtcgtcaaaaago^ 
a^to?Saotaaag^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 206 

MLCLCLYVPVIGEAQTEFQYFESKGLPAELKS IFKLSVFI PSQEFSTYRQWKQKIVQAGDKD 
LIX^LDFEEFVHYLQDHEKKLRLVFKILDKKN^ 

SMDKNGTMTIDWNEWRDYHLLHPVEN I PE 1 1 LYWKHST I FDVGENL WPDEFTVEERQTGMW 
WRHLVAGGGAGAVSRTCTAPLDRLKVLMQVHASRSNNM^ 

NVLKIAPESAIKFMAYEQIKRLVGSDQETLRIHERLVAGSLAGAIAQSSIYPMEVLKTRMAL 
RKTGQYSGMLDCARRILAREGVAAFYKGYVPNMLGI I PY AG I DLAVYETLKNAWLQHYAVNS 
ADPGVFVLLACGTMS S TCGQLAS YPLAL VRTRMQAQAS I EGAPEVTMS SLFKH I LRTEGAFG 
LYRGLAPNFMKVI PAVS i S YWYENLKITLGVQSR 
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FTOTFRF, 207 

GGAM^CAGCGGCAGCTCCACT 

^CTTCCCTGGGGCAGATCCTCTTCTGGAGCATAATO 
CAATTGCACTCATCATTGGCTTTGGTATTTCA^ 
GCCTCAGCTGGGAACATTGGGGAGGATGGAATCCTGAGCT 
ACTTTCTGATATCGTGATACAAT^ 

AAGAAGGCAAAGATGAGCTGTCGGAGCAGGATGAAATGTT 

GCTGATCAAGTGATAGTTGGCAATGCCT 

TGGCACCTACAAATGTTATATCATCACT 

AAACTGGAGCCTTCAGCATGCCGGAAGTGAATGT<X5ACTATAATGCCAGCT 

cggtgtgaggctccccgatggttccc<^^ 

GGGAGCCAACTTCTCGGAAGTCTCCAATACCAGCTTTGAG 

TGAAGGTTGTGTCTOTGCTCTACAATCTTA 

AATGACATTGCCAAAGCAACAGGGGATATCAAAGTGACAGAATC 

TCACCTACAGCTGCTAAACTCAAAGGCT^ 

GGGCACTTCTGCCTCTCAGCCCTTACCTGA 

CATGCAAAGTCATTGTTACAACAGGGATCTACACAACT 

TTTTATATTTCTGGGAGGAAATGAATT*^^ 

AGAAACAAAAAGAAGCCAAAAGCAGAAGGCTCCA^^ 

GACATATTAGAAGTTGGGAAAATAATTCATGTGAACTAGACAAGTGTGTTAAGAGTG^ 

TAAAATGC^CGTGGAGACAAGTGCATCCCX^ 

GGGGAGTGAGAGGACAGGATAGTGCATGTTCTTTGTCTCTGA^ 

TAATGTTGCTCTGAGGAAGCCCCTGGAT^GTCTATC 

CAAATTAAGCTGTAGTATCTACCCTAAGAOGCTGCT^ 

GGCGGCTGCATTTTAGTAATGGGTCAAATGATTCACTTTT^ 

TGGCTTCTCTTCCCAACTGACAAATGCCAAAGTTGAGAAA 

ACAGAGCAGTCGGGGACACJCGATTTTATAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



WO99/63088 /?>\ O PCMJS99/12252 

FTOTTRE 208 

MASLGQILFWSIISIIIILAGAIALIIGFGISGRHSITVTTVASAGNIGEDGILSCTFEPDI 
KLSDIVIQWLKEGVLGLVHEFKEGKDELSEQDEMFRGRTAWADQVIVGNASLRLKNVQLTD 
AGTYKCYI I TSKGKGNANLE YKTGAFSMPEVNVDYNAS SETLRCEAPRWF PQPTWWASQVD 
Q^3AOTSEVSNTSFELNSENVTMKWSVLYNVTI 
SHLQLLNSKASLCVSSFFAISWALLPLSPYLMliK 
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FIGURE 209 

GAATTTGTAGAAGACAGCGGCGTTGCCi^ 

GCTGGTGGCCGCTCTGTGGGGTGGCACGCAGra 

AGCGGGTTCATGAGCCX^CCTGGGCCCAGCAGTTGCTACAG^ 

AATACTGAGTACCTGATGCCCTTTCTCCTCAACCAGTGTC 

CTTGGCATCGACAGATCTGACCCTGGCTGTGCGCAT 

CACTGATTGTTGGGAAGGCCCTTGGAGAA^ 

GAGTGCGGGACGCAGCTCTGTGGATCTCGACATACCTGTGTTAGTTCCIT 

CTCCCCAGAGTGGGTGAGGACACGGCCTTTTCCCATCCTGCCCTTTCCT 

GCTTCCTTGTGGCCATCAGAGTTCCCTTCCCCTGGACAGTCTC 

GTTTGGGATTG^GACCAGACCCCATCTGAGCCCTrCCTCCAG 

GGCATGGCTGAGCTCAGACCCTCCTGATTTCT^ 

GGTGCTCACCGTGATAGGAATTTCACTCTGCATCACAAGCTCAG^^ 

AACAGTCTACCCTTTGAGTGGGCCGAACCCACTTCCAGCTCTC 

GGGCCATGAAGTGCTGGCAGTGAGCGGATGGACCTAGCACTTCCCCTCTCTGG 

CCTCCTCTCTTATGG^ATAACAGCTAC^ 

AGTTTTGTAACCTTCAAGTGCTGTTCAGC^ 

CCCTCAGCAACCTTTCTGCCCCAGCAGC^ 

GCCACCATTACTGTGGCCTGATCTGGACTATCATGGTGGCAGGTTCCATGGACTGCAGAA^ 

CCAGCTGCATGGAAAGGGCCAGCTGCAGACTTTGAGCCAGA/^TGCAAA^ 

GGACTCAGTCAGAGCGCTTTGGCTGAATGAGGGGT^ 

GTGGCAGATGCAGGAAATGAGCTGTCTATTAG 

AAATCCTCACTGCCAGCCCCTCTTAAACAGGTAGAGAGCT 

TCCAGCACACCTGGCGAGTAGTAGCrGTCAATAA^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
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FTOTTRE 210 

MAASLGQVLALVLVAMjWGGTQPLLKRASAGLQRVHEPTWAQQLLQEMKTLFLNTEYLMPFL 
LNQCGSLLYYLTLASTDLTLAVP I CNSLAI I FTLI VGKALGEDI GGKRKLDYCECGTQLCGS 
RHTCVSSFPEPISPEWVRTRPFPILPFPIKJLFCPLVAIRVPFPWTVWRKTEAGVWD 
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FIGURE 211 



CTTCTGTAGGACAGTCACCAGGCCAGATCCAG 

GAAGATGACAGC^TTATAGCAGGACCCT^ 

TTGCCAGTGGGAAGTACCTAGTGAAACGGCCTAAGA 

TGAGGCCCTGTGGTCCCCATCCTTGGGAG 

TTGCTGGTATCACTGCAGTGCTTGTTGCAGCT 

TCATGGGAAAAATCCTGTGTCAACAGCATTGCCTCTGAATGTC 

CTGTATCAGCTCCTCAGCCAGCTCCTCT 

TCTGCTCAGCGGAGAACTGCAGTGAGGAGACACACATTACAGCCTTCACT 

GCTGAAGAACACTTTCATTTTGTAAGCCAGTGCTC 

CGATGCCCTGGACCCTCCCCTGAAGAACGTGTCC^ 

AATCTAATGGAACTTCCTGTCGTGGGAAGCCCTGGAAATGCTATGAAGAAGAACAGTC 

TTTCTAGTTGCAGAACTTAAGAATGACATTGAGTCTAAGAGTCT 

CAACGTCAGTAACGCCACCTGTCAGTTC 

TCTTTCGAAAGTTTGAGTGTGCAAATC 

TCCCACAACGTGGGCTCCAAAGCTTCCCTCTACCTCTTGGCCCTTGCCAGCCTCCTT 
GGGACTGCTGCCCTGkGGTCCTGGGGCT^ 

AGGTCCAGAGCACCCCCTGCGGTGCTGACACCCTCTTTCCCTGCTCTGCCCCGTTT 

CCAGTAAGTGGGAGTCACAGGTCTCCAGGCAATGCCGACAGCTGCCTTO 

AAGCACTGGTTCATTCACTGCCAAAAAAAAAAAAAAAAAAAAAAAAAA 



WO99M30M -e-tg-AyO PCIWSWOT 

FTOTTUR 212 

MKGILVAGITAVLVAAVESLSCVQCNSWEKSCVNSIASECPSHANTSCISSSASSSLETPVR 
LYQNMFCSAENCSEETHITAFTVHVSAEEHFHFVSQCCQGKECSNTSDALDPPLKNVSSNAE 
CPACYESNGTSOTGKPWKCYEEEQCVFLVAELKNDIESKSLVLKGCSNVSNATCQFLSGENK 
TLGGVIFRKFECANVNSLTPTSAPTTSHWGSKASLYLL1AI1ASLLI1RGLLP 
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FIGURE 213 

GGCCTCGGTTCAAACGACCCGGTGGGTCTACAGCGGAAG<^^ 

GGGCreGCCTCACTGGCCACCCTC^ 

GCGCGCTGCTGTGGGTCCTGCTOCTCAATCTC 

ACCC^GACTCCGACCGAAATGCAGCGGGT 

CTACCGGAGCACCGCCO^ACTGGTCTTCCCCGGA^ 

ATGATGCCATGGCCGACGCCGACCGCCTGGCTGGACCAGCGGCTGCCGAGCTC^ 

ACGGTGTCCACCGGCTTTAGCCGGTCGTCCGCGATTAAC^ 

GGGGGTTGTGATTAATGCCGGAAAGGATAGCACC^^ 

ATACAGCGGGGAGTTCCAGCACX^GGTTTATAGCCAATAGTCAGGAGCCTC 

ACTTCAAGCCTGCCGCGCTCCCCCGGGAGGTCTACT^ 

CCTGAGCGAGTGGTCCACACCTGGGTCTACCCCG^^ 

CCATGCCATCTCCTGAGGATCTGCGGCTGGTGCTGATGCCCT 

TGCAAGTCGGGCACCATGAGCCX3GAGCC<^ 

TCGAGTTGGGGCGCIK»GCCAGCTC 

CCTGCAACCGACTTCGGGAAGAGTGCCCCCTGGACACAAGTCTCTGTACTGAC^ 

GCCTCTCAGAGCACCACCAGTACCAGGACC^^ 

AAGC^GTCCC^GCCTGCCACCCGCCAGCCCC^ 

GGATTGGCCTGGAGGATATTTGGAATAGCCTCTCTTCAGTGTTCACAGAGATGC^ 

GAGAGAAACCAGAGGT^TGGCGACTTCATCCACATGAGGAG 

CTTGCCCTTTCAATCCTAGCACCGACTAGAT^ 

CACAA 
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FTOTJRE 214 

MVPAAGALLWVLLLNLGPRAAGAQGLTQTPTEMQRVSLRFGGPMTRSYRSTARTGLPRKTRI 
ILEDENDAMADADRLAGPAAAELLAATVSTGFSRSSAINEEDGSSEEGWINAGKDSTSREL 
PSATPNTAGSSSTRFIANSQEPEIRLTSSLPRSPGRSTEDLPGSQATLSQWSTPGSTPSRWP 
SPSPTAMPSPEDLRLVLMPWGPWHCHCKSGTMSRSRSGKLHGLSGRLRVGALSQLRTEHKPC 
TYQQCPCNRLREECPLDTSLCTDTNCASQSTTSTRTTTTPFPTIHLRSSPSLPPASPCPALA 

FWKRVRIGLEDIWNSLSSVFTEMQP IDRNQR 
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FIGURE 21S 



PCT/US99/12252 



CCCGGGTCGACC<^CGCGTCC<X^^ 

CTAGCTGGGGCAGCGGCGCTGGCGAGOGGCTCCCAGGGCGACCGTGAGCC^ 

CTGCGTACTGCAGTGCGAAGAGCAGAACTGCTCTGGGGGCGCTCrc 

GCCAGCCAATCTACATGAGTCTAGCAGGCTGGACCT 

ATGTGGGTCACCGTTGGGCTCTACCTCCAGGAAGGTCAC^^ 

GTGGCX2CTTCTCCCCK3TTCCTGTTCTTTC^ 

ATGGCCTGGCCAGCCTGGTGATGCTCTGCCGCTACCGCACCTTCGT^ 

ATGTACC^CACCTGTGIWCCrrTCGCCTGGGTGTCCCT 

CTTCCACACCAGGGACACTGACCTCACAGAGAAAATG^ 

TCCTACACTC&ATCTACCTGTGCTGCGTCAGGACCGTGGGGCT 

AGTGCCTTCCGGGCTCTCCTGCTGCTCATGCTGACCGTCCACGTCT 

CCGCTTCGACTATGGCTACAACCTGGTGGCCAAGGTGGK^ 

GGTGGCTGGCCTGGTGCCTGTGGAACGAGCGGCGGCTGCCTCACGTO 

GTGGTCTTGCTGCTGCAGGGGCTGTCCCTGCTCGAGCT 

GGTCCTGGATGCCCATGCCATCTGGCACATCAGCACCATCCCTGTCCACGTCCT 
GCTTTCTGGAAGATGACAGCCTGTACCTGCTGAAGG 

TGAA GACOTTGGAGCGAGTCTGCCCCAGTGGGGATCCTGCCCCCGCCCTGOT 

CrcGCCTCAACCCTTGAGATGATTTTCTC^^ 

TGGGCCCAGAATCATGTGGCCAGCCCACCCCCTGTTGG 

CTAGGGAAGGCCTCCCAGCATCTGGGACrCGAGAGTGGG 

GAACTGGGGTGGAACTGAGTGTGTTCTTAGCTCTACCG 

CCACCAGCCTCCTCCCCACATCCCCAGCTGCCTGGCTCGGTCCTGAAGCCOT 

GGGAGACCAGGGACCACAGGCCTTAGGGATACAGGGGGTCCCCTTCTC 

CCTCCTCCAGGACACCACTAGGTGGTGCTGGATGCTTGTTCTTTGGCCAGCCAAGGTTCA 

GCGATTCTCCCCATGGGATCTTGAGGGA<X!AAGCTC 

GAGGGT-TGGGGTAGGGAGGT-TGGGAGGAGGGCTGAGGA 

CAGCAAGCCCAGGGCAAGGATCCTGTGCTGOTGTCTGGTTGAGAGCCTGCCAGCX3 

GGAGTGTGGGCCAGGCTGAGTGCATAGGTGACAGGGCCGTGAGCATGGGCCTGGGTC 

GAGCTCAGGCCTAGGTGCGCAGTGTGGAGACGGGTGTTGTCGGGGAAGAGGTGTC 

AGTGTGTGTGTGCAGGGGGTGGGTGTGTTAGCGTGC^ 

GGTGGGCATGTGAGATGAGTGACTGCCGGTGAATGTGTC 

ATGAGGGAATCCTGTCACCATCAATAATCACTTGTGGAGCGGCAGCTC 

CCTGGGCGGAGAGCCAGGAGCTCTCCATGGCCAGGCTGCGTGTG 

TGCCCCTTTGCCGGCCTCCTCCAAACCTCACA^ 

CAGCCCCTCX5GAGGCAGAGGAAGGAAAATGGGGATC 

CTCCTTGCCrrcGCATGGCnKSCCTTCCC 

TTTGCCATAGCCTGATTTTGGGGAGGAGGAAGGGGCGATTTGAGGGAGAAGGGG 

TATGGCTGGGTCTGGTTTCTTCCCTTGCCAGAGGGT^ 

CAGGCAGGGGCCACACrATGCCTGTGCCCTGGTAAAGGTCACCCCTG^ 

CCTGGCATGTTCCTGCCCCACAGGAATAGAATGGAGGGAGCTCCA 

AGGCAGTCTCCGTGGTTGAAGCAGACTCGATTTTTGCTC 

TTGAGGGAGGGGAGCTATGCTAGGACTCCAACCT 

CTTTTGATACTGAAAACTTTTAAGGTGGGAGGGTGGCAAGGGATC 

CCAAGCGTCAAAAAAAAAAAAAAAAA 



WO 99/63088 




PCTAJS99/12252 



MAGLAARLVLIAGAAALASGSQGDREPVYRDCVLQCEEQNCSGGALNHFRSRQPIYM 

TCRDDCKYECMWVTVGLYLQEGHKVPQFHGKWPFSRFLFFQEPASAVMFLNGIiASLVMLC!R 

YRTFVPASSPMYHTCVAFAWVSI^AWFWSTVFHTRDTDLTEKMDYFCASTVIIJISIYLCCVR 

TVGLQHPAWSAFRALIiLMLTVHVSra^ 

RLPHWKCVVVVLLI^GLSLI^LLDFPPLFWLDAHAIWHISTIPVHVLFFSFLEDDSLYLL 
KESEDKFKLD 
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FIGURE 217 

GGCCGCCTOGAATTGTGGGAGTTGTO^ 
CT£2^CTCCCCAGAGCC^ 

CTGATGGCCGCCTGCTrCACCTTCTGCCrCAGTCATCA 
CAACCCAGAGAAGAGCAGCAC 

TGGATGCCGAAGTCCTGGAGGTGTTCCACCCGACGCATGAGTGGCAGGCC 
CAGKSCTGTCCCTGCAGGATCC^ 

ACTCCAATATGAGGACAAGTTCCGAT^TAATTTGAAAGGCA^ 

ACACCTACACATCTCAGGATCTCAAGAGTGCACTGGCAAAAT^ 

GAGAGTTCAAAGGAAGACAAGGCAAGGCAGGCTGAGGTAAAGCKX5CTCTT 

GGAACTGAAGAAAGACTTTGATGAGCTGAATGTTC^ 

TACGGCTGATCAACAAGTTCAATAGTTCCAGCTC 

TTTGATCTTGAATATTATGTCCATCA 

TCTTCAAGTGGTGATCAATGGGCTGAACAGCACAGAGCCCCT 
TTGTGCTGGGCGCTGCCITTTCCAGCAACCCCAAGG 
GCCCTGCAGAAGCTGCTGGTCATCCTGGCC^ 
CCIGTTTGCACTGTGCTCCCTGCTGCGC 

TCGGGGGGCTGC^GGTCCTGAGGACCCTGGTGCAGGAGAAGGGCAG 

CGCGTGGTCACACTGCTCTACXIACCTGGTCACGGAGAAGATC 

GCTGACCCAGGAGATGTCCCCAGAGAAGCTG 

GCCTGTGGGAACAGGGCTGGTGCGAGATC^^ 

GCCCGTGAGAAGGTGCTGCAGACACTGGGCGTC^ 

TCAGGACCCCCAGCTCGGCAGGACACTGGC 

GCCTGGAGCTGCAGGATGGTGAGGACGAGGGOT 

AGCTTGCTGAAGGAGCTGAGAT^GGCCGCACACC^GGACTGGA 
GGCTGAGGGGTGCCAG'CGTGGGTGGGCTO 

TGGCCATTAAATGGAAACCTGAAGGCCAAAAAAAAAAAAAAAAA^^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



WO 99/63088 / PCT/US99fl22n 

FTCTJRE218 

MAPQSLPSSRMAPLGMLLGLLMAACFTFCLSHQNLKEFALTNPEKSSTKETERKETKAEEEL 
DAEVLEVFHPTHEWQAWPGQAVPAGSHVRIiNIiQTGEREAK1« 

TYTSQDLKSAliAKFKEGAEMESSKEDKARQAEVKRLFRPIEELKKDFDELNWIETDMQIMV 

RLINKFNSSSSSLEEKIAALFDLEYYVHQMDNAQDLLSFGGLQVVINGLNSTEPLVKEYAAF 

VLGAAFSSNPKVQVEAIEGGALQKIiLVIIiAtEQPLTAKKKVLFALCSLLRHFPYAQRQFL^ 

GGLQVLRTLVQEKGTEVLAVRVVTLLYDLVTEroiFAEEEAELTQEMSPEKLQQYRQVHIiLPG 

LWEQGWCEITAHLLALPEHDAREKVLQTLGVLLTTCRDRYRQDPQLGRTI^ 

LELQDGEDEGYFQELLGSVNSLLKELR 



WO 99/63088 



PCT/US99A2252 



FTOTTRTC219 



TTGGGCTTCQjTAGAGGAAGTGGCGCX^ACCTTCATITGGGGTTTCGGTTCCCCCCCT^^C 

CTTCCCCGGWTCTGGGGGTGACATTG^ 
03CGC^CrCCCCAGCTGGCGCGCC<:CTCCCATTra 

TTCCCACCTCSACC^GCC^SSGGGGCTGCGGTGTTTTTCGGCTGCACTTTCGTCGC^ 
CCGGCCTTCGOGCTTTTCTTGATCACTGTGGCTGGGGACCCGCTTCGCGTTATCATCCTGGT 

cgcaggggcatttttctggctggtctccctg^^ 

TCCATGTGACCGAC CGGTCAGATGCCCGGCTCCAGT ACGG CCTCCTGATTTTTGGTGCTGCT 
GTCTCTGTCCTTCTACACK^GGTGTTCCGCTTTGCCTACTACAAGCTGCTTAAGAAGGCAGA 

TGAAGGGTTAGCATCGCTGAGTGAGGACGGAAGATCACCC^ 

ATGTTTCTGGTCTCTCCTTCGGTATCATCAGTGGTGTCTTCTCTG 

GATGCACTTGGGCCAGGTGTGGTTGGGATCCATGGAGACTCACCCTATTACTTCC^ 

AGCCTTTCTGACAGCAGCCATTATCCTGCTCCATACCTTTTGGGGAGTTGTGTTCTT^ 

CCTGTGAGAGGAGACGGTACTGGGCTTTGGGCCTGGTGGTTGGGAGTCACCTACTGACATCG 

GGACTGACATTCCTGAAGCCOTGGTATGAGGCCAGCCTGCTGCXrCATCTATGCAGTCACTGT 

TTCCATGGGGCTCTGGGCCTTCATCACAGCTGGAGGGTCCCrCCGAAGTATTCAGCGC^ 

TCTTGTGTAAGGACISACTACCTGGACTGATCGCCTGACAG^ 
CCCATGACTGAGCCCAGCCXZCAGCCCGGGTCCATTGCCCACATTCTCTGTCT 
GGTCTACCCCACTACCTCCAGGGTTTTGCTTTGTCCTTTTGTGACCGTTAGTCTCTAAGCTT 
TACCAGGAGCAGCCTGGGTTCAGCCAGTCAGTGACTGGTGGGTTTCAATCTGCACTTATCCC 
CACCACCTGGGGACCCCCTTGTTGTGTCCAGGACTCC<X;CTGTGTCAGTGCrCTGCTCTCAC 
CCTGCCCAAGACTCACCTCCCTTCCCCTCTGCAGGCCGACGGCAGGAGGACAGTCGGGTGAT 
GGTGTATTCTGCCCTGCGCATCCCACCCGAGGACTGAGGGAACCTAGGGGGGACCCCTGGGC 
CTGGGGTGCCCTCCTGATGTCCTCGCCCTGTATTTCTCCATCTCCAGTTCTGGACAGTGCAG 
GTTGCCAAGAAAAGGGACCTAGTTTAGCCATTGCCCTGGAGATGAAATTAATGGAGGCTCAA 
— GGATAC * ^^^^^^^^^ a ^^^ T ^ RAGA CTGGACATCTTGGTCTTTTTCTC 



AGG^"cTGAGGGGGAACCATTTTTGGTGTGATAAATACCCTAAACTGC<_ri 11 1 i 1 w * 1 1 1 1 
GAGGTGGGGGGAGGGAGGAGGTATATTGGAACTCTTCTAACCTCCITGGGCTATATTTTCTC 
TCCTCGAGTTGCTCCTCATGGCTGGGCTCATTTCGGTCCCTTTCTCCTTGGTCCCAGACCTT 
GGGGGAAAGGAAGGAAGTGGATGTTTGGGAACTGGCATTACTGGAACTAATGGTTTTAACCT 
CCTTAACCACCAGCATCCCTCCTCTCCCCAAGGTGAAGTGGAGGGTGCTGTGGTGAGCTGGC 
CACTCCAGAGCTGCAGTGCCACTGGAGGAGTCAGACTACCATGACATCGTAGGGAAGGAGGG 
GAGATTTTTTTGTAGTTTTTAATTGGGGTGTGGGAGGGGCGGGGAGGTTTTCTATAAACTGT 
ATCATTTTCTGCTGAGGGTGGAGTGTCCCATCCTTTTAATCAAGGTGATTGTGATTTTGACT 
AATAAAAAAGAATTTGTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 220 

MGAAVFFGCTFVAFGPAFALFLITVAGDPLRVIILVAGAFFWLVSLLLASVVWFILVHVTDR 
SDARIiQYGLLIFGAAVSVliLQEVFRFAYYKLLKKADEGIjASLSEDGRSPISIRQMAYVSGLS 
FGI ISGWSVINILADALGPGVVGIHGDSPYYFLTSAFLTAAIILLHTFWGVVFFDACERRR 
YWAIX3LWGSHLLTSGLTPLNPWYEASLLPI YAVTVSMGLWAF ITAGGSLRS IQRSLLCKD 



WO99/6308S -^g/^O 

FTGIJRE 221 

AAGCTGGTTTAAGGAAGCAGAGGAGGGTTAGATTCGTTGAGTGAGGACXX3AAGATCAACCCA 

TTTCCATTCCGCCAGATGGCCTATGTTTCTGGTCTCTCCCTTCX3GNATCATCAG 

TOTCTGTTATCAATATTTTGGCTGATGCANTTGGG 
TCACCCTATTANTTCCTGANTTCAGCCTTTNTGACAGCAGCCATTATC 



WO 99/63088 / PCTAJS99/122S2 

FIGURE 222 

GACCGACCGTTCAGATGCCCGGTTCCAGTACGGCTTCCTGATTTTTGGTGCTGCTGTNTCTG 

TCCTTCTACAGGAGGTGTTCCGCTTTGCCTAN^ 

TTAGCATNGCTGAGTGAGGACGGAAGATCACCCATO 

TGGTNTTTCCTTCGGTATCATCAGTGGTGTTTTNT 

TTGGGCCAGGTGTGGTTGGGATCCATGGAGANT^ 

NTGACAGCAGCGATTATCCTGNTCCATACCTT^ 

GAGGAG 



/PCT/US99/12252 

FTOTTRE 223 

NGTTGGAGAAGTGGCGCGGACNTTCATTTGGGGTTTCGGTTTCCCCCCTTTCCCTTTCCCCG 

GGGTCTGGGGTGACATTGCACGGGCCCCTCGTGGGG 

CCAGNTGGNGC^CCTTCCCATTTGCCTGTCCTGGTCAGGCCCCCACOT^ 

ACCAGCGATGGGGGCTGCGGTGTTTTTCGGCIX3CACTTTCGTCGCGTTCG 

CGCTTTTCTTGATCaCTGTGGCTGGGGACCCGCTTCGCGTTAT^ 

TTTTTCTGXSCTGGTCTCCCTGCTCCTGGCCTC^ 

CGACCGGTCAGATGCCCGGCTCCAGTACGGCCTCCTC 

TTCTACAGGAGGTGTTCCGCTTTGCCTACTACAAGCTGCTTAAGAAGGCAGATC 

GCATCX3CTGAGTGAGGACGGAAGATCACCCATCTC 

TCTCTCCTTCGGTATCATCAGTOSTGTCTTCTCTGTTA^ 

GGCCAGGTGTGGTTGGGATCCATGGAGACTCACCC 



WO 99/63088 / PCT/US99/12252 

FIGURE 224 

GTAAAAGAAAGTGGCCGGACCTC 

TCTGGGGGTGACATTGCACCGCGCCCNTCGTGGGGTaSCGTTGC 

CAGNTGGCGCGCCCCTCCC^TTTC 

CCAGCCATGGGGGCTGCGGTGTTTTTCte 

GCGCTTTTCTTGATCACTGTGGCTGGGGAC^ 

ATTTTTCTGGCTGGTCTCCCTGCTCCTG^ 

CCGACCGGTCAGATGCCCGGCTCCAGTACGGCCT 

CTTCTACAGGAGGTGTTCCGCTTTGCCTACT^ 

AGCATCGCTGAGTGAGGACGGAAGATCACCCATCTCCATCCGCCAGATGGCCTATGTTTCTG 

GTCTCTCCTTCGGTATCATCAGTGGTGTCTTCTCTGTTATCAAT^ 

GGGCCAGGTGTGGTTGGGATCCATGGAGAC 



WO 99/63088 > PCT/US99A2252 



FTOTTRE 225 



GCCCCAGGGAGCAGTGGGTGGTTATAACTCAGGCCCGGTGCCCAGAGCCCAGGAGGAGGCAG 
TGGCCAGGAAGGCACAGGCCTGAGAAGTCT^ 

TACCTGGGGGACAGGGCT^GTGAGACCTGGTGAGGGTGGCTCAGCAGGCAGGGAAGGAGAGG 
TGTOTGTGCGTCCTGCACCCACATCTTTCTOTGTCCCCTCCTTGCCCrGTCTGGAGGCTGCT 
AGACTCCTATCTTCTGAATTCTATAGTGCCTGGGTCTCAGCGCAGTGCCX3ATGGTGGCCOG 

CCTTGTGGTTCCTCTCTACCT^^ 

CCTGGATGTGGGTGCTCTGTGCTCTGATCACAGCCTTGCTTCTGGGGGTCACAGAG 

CTCGCCAACAATGATGTTTCCTGTGACCACCCCTCTAACA^ 

GQACCTOGfflkGCTGGG«XM^^ 

ATGGATCCGACTGCGATATGCACACCCAGCCGTGGCAGGCCGCGCrGTTC 
CAGCTCTACTGCGGGGCX^TGTTGGTGCATCCACAGTGGCTGCT 

GAAGAAAGTTTTCAGAGTCCGTCTCGGCCACTACTCCCTGTCACCAGTTTATGAATCTGGGC 
AGCAGATGTTCCA(3GGGGTCAAATCCATCCCCCACCCTGGCTACTC^ 

AACGACCTCATGCTCATCAAACTGAACAGAAGAATTCGTCCCACTAAAGATGTCAGACCCAT 
O^CGTCTCCTCTCATTGTCCCTCTGCTGGGACAAAGTGCTTGGTGTCTGGCTGGGGGACAA 
CCAAGAGCCCCCAAGTGCACTTCCCTAAGGTCCTCCAGTGCTTGAATATCAGCGTGCTAAGT 

-CAGAAAAGGTGCGAG^^^ 
CAAAGCAGGTAGAGACTCCTGCCAGGGTGATTCTGGGGGGCCTGTGGTCTGCAATGGCTCCC 

TGCAGGGACTCGTGTCCTGGGGAGATTACCCTTGTGCCCGGCCCAACAGACCX3GGTGTCTAC 

ACGAACCTCTGCAAGTTCACCAAGTGGATCCAGGAAACCATCCAGGCCAACrCCTGAGTCAT 

CCCACX3ACTCAGCACACCGGC^TCCCCACCTGCTGCAGGGACAGCCCTGACACTCCTTTCAG 

ACCCTCATTCCTTCCCAGAGATGTTGAGAATGTTCATCTCTCCAGCCCCTGACCCCATGTCT 

CCTGGACTCAGGGTCTGCTTCCCCCACATTGGGCTGACOTTGTCTCTCTAGT^ 

GAAC^TTTCCAAAACTGTCCAGGGCGGGGGTTGCGTCTCAATCTCCCTGGGGCACTTTCAT 

CCTCAAGCTCAGGGCCCATCCCTTCTCTGCAGCTCTGACCCAAATTTAGTCCCAGAAATAA^ 

CTGAGAAGTGGAAAAAAAAA 



WO 99/63088 / PCTAJS99/12252 

FIGURE 226 

MATARPPWMWVLCALITALLLGV^ 
SSSRIINGSDCDMHTQPWQAALLLRPNQLYCGA^ 

PVYESGQQMFQGVKS I PHPGYSHPGHSNDLMLI KLNRR IRPTKDVRPINVSSHCPSAGTKCL 
VSGWGTTKSPQVHFPKVLQCLNISVLSQKRCEDAYPR 
WCNGSLQGLVSWGDYPCARPNRPGVYTNLCKFTKWI QET I QANS 



WO 99/63088 / 



PCTAJS99/12252 



FTOTTRE 227 



TGCTGGCTGTAG< 
GCGCCCCCACCTC 
AAGGGCGGACAG< 



TGCTGGCTGTAOT^ 
GCGCCCCCACCTGTCG1 



^GCCCCCACCTGTCGTCAGCACTGGGGCTGCCAGCGC 



ACTOCTCCGTG^&CTTCTT^GQGGCTGGQACG0GTACC6AQACGQCTTTGGCAGGCTCACC 



SSScgtcScctc^^ 



_CCCTCCCAAT.GCCAACCCCCACC^m^ 





aKXKSCTGCCATCCCATTTCTC 



GGGCCTCAGCCTGG CT3TCGGCTGGG TGj3C 



rAnCTGCTGCCTTTGGGGACCCACGCTCCAAGGCTGAGACCAGTTCCCIK^AGGCCACCC^ 



rrACTCrac^TGGG^GCTCT^ 
SStCTACAA^ 
GCCTTGGGGGTGATG^ 

SSgaggtgag^ 

tcSSgctgtctgc^ 
^^tScc^ggcctcctgatgg^gotccgaaa 

^SccSggg^tgtggggttccgttctcccttcccctcccactgaagtttgtgctta^ 
tgtcccagtgccaccaggtcatccacatgogcag 
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FIGURE 228 

MVNDRWKTMGGAAQLEDRPRDKPQR^ 

APPP WSTGAASANS ALVTVERADSSHLS I LI DPRCPDLTDS FARLES AQAS VLQALTEHQA 

QPRLVGDQEQELIJ)TLADQLPRLIiARASELQTECMGLRKGHGTLGQGLSAW 

SESQGHMAHLWSVSDILDALQRDRGLGRPRNKADLQRAPARGTRPRGCATC 

IiSGQQDDGVYSVFPTHYPAGPQVYCDMRTDGGGWTVFQRREDGSVNF 

GEHWLGLKRIHALTTQAAYEIiHVDLEDFENGTAYARYGSFGVGLFSVDPEETC 

GTAGDSLLKHSGMRFTTKDRDSDHSEttiNCAAFYRGAW^ 

VEWSSWTGWQYSLKFSEMKIRPVREDR 



WO 99/63088 



PCT/US99/1225Z 



FTOTJRE 229 

GCAGTCAGAGACTTCCCCT^ 

TGCTTCCTGAACTAGCTCACAGTAGCCCGGCGGCCCAGGGC^^ 
CACCGCTGTAGGAATCCAGAJ^CAGGCCAAGTACAGCAGCACGAGGGACATGCT 

ATGGGGACACCACCATGAGCCTGCATTCTCAAGCCTCTGCCACAACT 
CGQQGCACAGAGCAC1AGGGCTCCCTCTTCAACGTGG0GACCAGTGGCCCTGACCCTGCTGAC 

TTTGTGCTTGGTGCTGCTGATAGGGCTGGCAGCCCTGGGGCTTTTC 
AGCTCTCCAATACTGGTCAAGACACCATTTCrC^AATGGAAGAAA^ 

CAAGAGTTGCAATCTCTTCAAGTCCAGAA^ 

TGAAAAACTCTGTCGTGAGCTGTATAAC^^ 

^CAATGGAAATGGCATGGAGACAATTGCTACCAG 

GACTGTAAATATTTCTGCCTTAGTGAAAACTCTACCATGCTGAAGATAAACAA^ 
CCTGC^TTTG<X:GCGTCTCAGAGCTACTCTGAGTTTTTCTACrrCTTATTGGACAGGGCTTT 

TGCGCCCTGACAGTGGCAAGGCCTGGCTGTGGATGGATGGAACCCCTTTCACTTCT 
TTCCATATTATAATAGATGTCACCAGCCCAAGAAGCAGAGACTGTGTGGCCATCCTCAATGG 
GATGATCTTCTCAAAGGACTGCAAAGAATTGAACKX3TTGTGTCTGTGAGAGAAGGGCAGGAA 
TGGTGAAGCCAC^GAGCCTCCATGTCC<:CCCTGAAACATTAGGCGAAGGTGACTG&TTGGCC 



TGGGAAATGGAACATAATCAGGAAAGACTATCTCTCTGACTAGTACAAAATGGGTTCTCGTG 
TTTCCTGTTCAGGATCACCAGCATTTCTGAGCTTG^ 

AGAAGTCTTATTTAC^TGCCACCAACCAACCTC^GAAACCCATAATGTCATCTGCCT^ 
GCTTAGAGATAACITTTAGCTCTCTTTCTTCTCAATGTCTAATATCACCTCCCTGTrrTCAT 

GTCTTCCTTACACTTGGTGGAATAAGAAACTTTTTGAAGTAG 
ATCCTTTTCTCTGACAGTGAAGTAGTCCATCAGAAATTGGCAG 
AGCAAATACACAAGGAATTCTTTTTGTTTGTTTCAGTTCATACTAGTCCCT 
CAGTAAAGACCCCATCTGCCTTGTCCATGCGGTTTCCCAA<^GCX3ATGTCACTTGATATGAG 

AATCTCAAATCTCAATGCCTTATAAGCATTCCTTCCTGTGTCCATTAAG 

TCTCCCCTCCATAGGAATTTCTCXZCAGGAAAGAAATATATCCGCATCTCCG 

AACTACCGTCCCCGATATTCCCTTCAGAGAGATTAAAGACCAGAAAAAAGTGAGCCTCTTC^ 

TCTGCACCTGTAATAGTTTCy^TTCCTATTTTCTTCCATTGACCCATATTTATA 
GTACTGAAGATTTAATAATAATAAATGTAAATACTGTGAAAAA 



WO 99/63088 / I PCT/US99/12252 

FIGURE 33 0 

MQAKYSSTRDMLDDDGDTTMSLHSQASATTRHPEPRRTEHRAPSSTWRPVALTLLTLCLVLL 
IGIJU^LLFFQYYQLSNTGQDTISQMEERIXSNTSQEI^SLQVQNIKIAGSLQHVAEKLCRE 
LYNKAGAHRCSPCTEQWKWHGDNCYQFYKDSKSWEDCKYFCLSENSTMLKINKQEDLEPAAS 
QSYSEPFYSYWTGLLRPDSGKAWLWMDGTPFTSELPHI I IDVTSPRSRDCVAILNGMIFSKD 
CKELKRCVCERRAGMVKPESLHVPPETLGEGD 



WO 99/63088 / PCT/US99/12252 

FIGURE 231 

AATTTTCACCGCTGTAGGAATCCAGATGCA^ 

ATGATGATGGGACACCACCATGAGCCTGCATTNTCAAGCTTT 

AGCCCCGGCGCACAGAGCACAGGGNTCCTTTTTCAACGTGGCGACX1AG 

CTGACTTTGTGCTTGGTGCTGCTGATAGGG^^ 

CTACCAGCTCTCCAATACTGGTCAAGACACC^TTTCTCAAATGGAAGAAAGATTAGGAAATA 
CGTCCCAAGAGTTGCAATTTNTTCAAGTCCAGAATATAAAGCTTGCAGGAAGTOT 
GTGGCTGAAAAACTCTGTCGTGAGCT<3TATAACAAAGCTGGA<3GAACTTTC 
AGTNTCCTCATNTACTATACACACACCACTTCCC 



WO 99/63088 / PCT/US99/12252 

FTCTJRE 232 

GCCGAGCGCAAGAACCCTGCGCAGCCCAGAGCAGCTGCTGGAGGGGAATCGAGGCGCGGCT^ 
CGGGGATTCGGCTCGGGCCGCTGGCTCTGCTCTGCGGGGAGGGAGCGGGCCCGCCCGCGGGG 
CCCGAGCCCTCCGGATCCGCCCCCTCCCCGGTCCCGCCCCCTCGGAGACTCCrCTGGCT 
CTGGGGGTTCGCCGGGGCCGGGGACCCGCGGTCCGGGC^ 

TCGGTGCTGCGGCCCGCAGGGCCCGTGGCCGTGGGCATCTCCCTGGGCTTCACCCTGAGCCT 
GCTCAGCGTCACCTGGGTGGAGGAGCCGTGCGGC^ 

AGCTGCCGCCGCGCGGCAACACCAACGCGGCGCGCCGGCCCAACTCGGTGCAGCCCGGAGCG 
GAGCGCGAGAAGCCCGGGGCCGGCGAAGGCGCCGGGGAGAATTGGGAGCCGCGCGTCTTGCC 
CTACCACCCTGCACAGCCCGGCCAGGCCGCCAAAAAGGCCGTCAG 
CGGAGCTGGGCATCAGGCAGAGGCTGCTGGTGGCGGTGCTGACCT 

ACGCTGGGCGTGGCCGTGAACCGCACGCTGGGGCACCGGCTGGAGCGTGTGGTGTTCCTGAC 
GGGCGCACGGGGCCGCCGGGCCCCACCTGGCATGGCAGTGGTGACGCTGGGCGAGGAGCGAC 
CCATTGGACACCTGC^CCTGGCGCTGCGCCACCTGCTGGAGCAGCACGGCGACGACTTTGAC 

TGGTTCTTCCTGGTGCCTGACACCACCTACACCGAGGCQCACGGCCI^ 
CCACCTCAGCCTGGCCTCCGCCGCCCACCTGTACCTGGGCCGGCCCCAGGACTTCATCGGCG 

GAGAGCCCACCCCCGGttXXrTA<TOCCAC^^ 

CTGCAACAACTGCGCCCCCACCrGGAAGGCTGCCGC/VACGAC^TCGTCAGTGCGCGCCCrGA 
CGAGTGGCTGGGTCGCTGCATTCTCGATGCCACCGGGGTGGGCTGCACTGGTGACCACGAGG 
GGGTGCACTATAGCCATCTGGAGCTGAGCCCTGGGGAGCCAGTGCAGGAGGGGGACCCTCAT 
TTCCGAAGTGCCCTGACAGCCCACCCTGTGCGTGACCCTC 

AGCTTTCGCCCGAGCTGAACTGGAACGCACGTACCAGGAGATCCAGGAGTTACAGTGGGAGA 
TCCAGAATACCAGCCATCTGGCCGTTGATGGGGACCGGGCAGCTGCTTGGCCCGTGGGTATT 

CCAGCACCATCCCGCCCXX3CCTCCCG<^T^^ 
GCACGCTTTCTCCTGCGCCK5ATGGCTCACCCCGCTGCCCACTGCGT 

ATGTGGCCGATGTTCTGGGGACAGCTCTAGAGGAGCTGAACCGCCGCTACCACCCGGCCTTG 
CGGCTCCAGAAGCAGCAGCTGGTGAATGGCTACCGACGCTTTGATCCGGCCCGGGGTATGGA 
ATACACGCTGGACTTGCAGCTGGAGGCACTGACCCCCCAGGGAGGCCGCCGGCCCCTCACTC 
GCCGAGTGCAGCTGCTCCGGCCGCTGAGCCGCGTGGAGATCTTGCCTGTGCCCTATGTCACT 
GAGGCCTCACGTCTCACTGTGCTGCTGCCTCTAGCTGCGGCTGAGCGTGACCTGGCCCCT^ 
CTTCTTGGAGGCCTTTGCCACTGCAGCACTGGAGCCTGGTGATGCTGCGGCAGCCCTGACCC 
TGCTGCTACTGTATGAGCCGCGCCAGGCCCAGCGCGTGGCCCATGCAGATGTCTTCGCACCT 
GTCAAGGCCCACGTGGCAGAGCTGGAGCGGCGTTTCCCCGGTGCCCGGGTGCCATGGCTCAG 
TGTGCAGACAGCCGCACCCTCACCACTGCGCCTC^ 

TGGACACACTGTTCCTGCTGGCCGGGCC^GAC^CGGTGCTCACGCCTGACrTCCTGAACCG^ 
TGCCGCATGCATGCCATCTCCGGCTGGCAGGCCT 

CCCAGGTGTGGCCCCACCACAAGGGCCTGGGCCCCCAGAGCTGGGCCGTGACACTGGCCGCT 

TTGATCGCCAGGCAGCCAGCGAGGCCTGCTTCTACAACTCCGACTACX3TGGCAGCCCGTGGG 

CGCCTGGCGGCAGCCTCAGAACAAGAAGAGGAGCTGCTGGAGAGCCTGGATGTGTACGAGCT 

GTTCCTCCACTTCTCCAGTCTGC^TGTGCTGCGGGCGGTGGAGCCGGCGCTGCTGCAGCG 

ACCGGGCCCAGACGTGCAGCGCGAGGCTCAGTGAGGACCTGTACCACCGCTGCCTCCAGAGC 

GTGCTTGAGGGCCTCGGCTCCCGAACCCAGCTGGCCATGCTACTCTTTGAACAGGAGCAGGG 

C^CAGCACCT^CCCCACCCTGTCCCCGTGGGCCGTGGC^TGGCC^CACCCCACCCCACT^ 

CTCCCCC7VAAACCAGAGCCACCTGCCAGCCTCGCTGGGCAGGGCTGGCCGTAGCCAGACCCC 

AAGCTGGCCCACTGGTCCCCTCTCTGGCTCTGT^ 

GGACGTGCCCCCAGAGCCACCC^CTTCTCATCCCAAACCCAGTTTCCCTGCCCCCTGACGCT 

GCTGATTCGGGCTGTGGCCTCCACGTATTTATGCAGTACAGTCTGCCTGACGCCAGCCCTGC 

CTCTGGGCCCTGGGGGCTGGGCTGTAGAAGAGTTGTTGGGGAAGGAGGGAGCTGAGGAGGGG 

GCATCTCCGAACTTCTCCCTTTTGGACCCTGCCGAAGCTCCCrGCCT 

AGTGTGGAAAAA 
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PCTAJS99/122S2 

FTOIJRE 233 

MRASUiSVIJIPAGPVAVGISLGFTLSIiSVTWVEEPCGPGPPQPGDSELPPR 
NSVQPGAEREKPGAGEGAGENWEPRVLPYHPAQPGQAAKKAVRTRYISTELGIRQRLLVAVL 

TSQTTLPTLGVAVNRTLGHRLBRVVPLTGARGRRAP PGMAVVTLGEERPI-GHLHLALRHIjLE 
QHGDDFI)WFTLVPDTTYTEAHGIiARLTGHLSIiASAAHL^ 

vi^rmiaqqi^phlegcrndivsarpdew^^^ 

VQEGDPHFRSALTAHPVRDPVHMYQLHKAFARAELERTYQEIQELQWEIQNTSHLAVDG 
AAWPVGIPAPSRPASRFEVLRWDYFTEQHAFSCADGSPRCPLRGADRADVADVLGTALEELN 

RR YHP ALRLQKQQLVNGYRRFD PARGME YTLDLQLEALT PQGGRRPLTRRVQLLRP LSRVE I 
LPVPYVTEASRLTVIiLPLAAAERDLAPGFI^ 

HADVF APVKAHVAELERRF PGARVPWLSVQTAAP S P LRLMDLLS KKHP LDTLFLLAGPDTVL 
TPDFLNRCRMHAISGWQAFFPMHFQAFHPGVAPPQGPGPPELGRDTGRFDRQAASEACFYNS 

DYVAARGRLAAASEQEEELLESLDVYELFLHFSSLH^RAVEP 
YHRCLQSVLEGLGSRTQLAMLLFEQEQGNST 
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FI G URE 234 

GCTCTGGCCGGCCCCGGCGATTGGTCACCGCCCGCTAGGGGACAGCCCTGGCCTCCTCTGAT 

TGGCAAGCGCTGGCCACCTCCCCACACCCCTTC 

AGCTATTAGCCAATTCGGCAGGGCCCGCTTTT^ 

ACTAGCGGAAGCTCTGCCTCTTTCCCGAGTGG^ 

CTGTATCCACCCAAATGTCACCGATTTCTT^ 

GAAATTTCTCAGCCTGGCCGAAAATGGTTGGCCCCACGAAGCCACGACAACT 
AGGGTTGCTCAACGCCCCGCCTCATTGGAAAACCAAATCAGATCTGG 
GCGGAGGCGGGGCGATGATTGTCGCGCTCGCACCCACT 
TTCCCCGCCCCTGAGACCCTGCAGCACGkTCTGTC^ 

GCTCGCCGTCTTTTGGCX3GCAGCGGCGACGCGAGGGCTCCCGGCCGCCCGTOTCCGCTGGGA 

ATCTAGCTTCTCCAGGACTGTGGTCGCCCCGTCCGCTGTGGCGGGAAAGCGGCCCCCAGAAC 

CGACCACACCGTGGC^GAGGACCCAGAACCCGAGGACGAAAACTTGTATGAGAAGAACC^ 

GACTCCCATGGTTATGACAAGGACCCCGTTTTGGACGTCTGGAACATGCGACT^ 

CTTTGGCGTCTCGATCATCCTGGTCCTT^ 

GGATGAAAGAGTGGTCCCGCCGCGAAGCTGAGAGGCTTGTGAAATACCGAGAGGCCAATGGC 
CTTCCCATCATGGAATCC^CTGCTT^ 

ACCAGTTGCTAAGTGGGGCTCAAGAAGCACCGCCTTCCCCACCCCCTGCCTGCCAT^ 
CTCTTOTCAGAGCACCTAATTAAAGGGGCTGAAAGTCTGAA 



FTOITRE 235 

MAAGLFGL S ARRLLAAAATRGLP AARVRWES S FS RTWAP S AVAGKRP P E PTTP WQED PE PE 
DENLYEKNPDSHGYDKDPVLDVWNMRLVFFFGVS I ILVLGSTFVAYLPDYRMKEWSRREAER 
LVKYREANGLPIMESNCFDPSKIQLPEDE 



WO 99/63088 , / PCT/US99/12252 

FTOTIRE 236 

GGCGGCTGGGCTGTTTGGTTTGAGCGCTCGCCGTCTTTTGGCGGCAGCGGCGACGCGAGGGC 
TCCCGGCCGCCCGCGTCCGCTGGGAATCTAGCTTCTCCAGGACTGTGGTCGCCCCGTCCGCT 
GTGGCGGGAAAGCGGCCCCCAGAACCGACCACACCGTGGCAAGAGGACCCAGAACCCGAGGA 
CGAAAACTTGTATGA6AAGAACCCAGACTCCCATGGTTATGACAAGGACCCCGTTTTGGACG 
TCTGGAACATGCGACTTGTCTTCTTCTTTGGCGTCT^ 

TTTGTGGCCTATCTGCCTGACTACAGGATGAAAGAGTGGTCCCGCCGCGAAGCTGAGAGGCT 

TGTGAAATACCGAGAGGCCAATGGCCTTCCCATCATGGAATC^ 

AGATCCAG 



WO 99/63088 / PCT/US99/12252 

FTGIJRE 237 

GCGGCGGCT^CCGCTTGCTCTGCTCGTCCTGTTGCTCCTGGGGCCOGGCGGCTGGTGCCT 
TGCAGAACCCCCACGCGAC^GCCTGCGG<^GGAACTTGTCATCACCCCGCTGCCTTCCGGGG 

ACGTAGCCX3C<^CATTCCAGTTCCGCACGCG^^ 
TCCCATTACAGGCTCTTTCCCAAAGCCCTGGGGCA 

GCTGCACCTGTCATTCACACAAGGCTTTTGGAGGACCCGATACTGGQGGCCACCCTTCCTGC 
AGGCCCCATCAGGTGCAGAGCTGTGGGTCrGGTTCCAAGACACTGTCACTGATGTGGATAAA 

TCTTGGAAGGAGCTCAGTAATGTCCTCTCAGGGATCTTCTGCGCCTCTCTCAAOT 
CTCCACCAACACAGTCACTCCCACTGCCTCCTTCAAACCCCTGGGTCTGGCCAATGA^ 
ACCACTACTITTCTGCGCTATGCTGTGCTGCCGCGGGAGGTGGTCrGCACCXSAAAACCT 
CCCTGGAAGAAGCTCTTGCCCTGTAGTTCCAAGGC^GGCCTCT 

TCGCTTGTTCCACACCAGCTACCACTCC^GGCAGTGCATATCCGCCCTGTTTGCAGAAATG 
CACGCTGTACTAGCATCTCCT<3GGAGCTGAGGCAGACCCTGTCAGTTGTATTTGATGCCTTC 
ATC^CGGGGCAGGGAAAGAAAGACTGGTCCCTCTTCCGGATGTTCTCCCGAACCCrCACGGA 
GCCCTGCCCCCTGGCTTC^GAGAGCC!GAGTCTATGTGGACATCACCACCTACAACCAGGACA 
ACGAGACATTAGAGGTGCACCCACCCCCGACCACTACATATCAGGACGTCATCCTAGGCACT 
CGGAAGACCTATGCCATCTATGACTTGCTTGACACCGCCATGATCAACAACTCTCX3AAACCT 
CAACATCCAGCTGAAGTGGAAGAGACCCCGA<1AGAATGAGG^ 

ATGCCCAGCGGTACGTGAGTGGCTATGGGCTGCAGAAGGGGGAGCTGAGCACACTGCTGTAC 

AACACCCACCCATACCGGGCCTTCCCGGTGCTGCTGCTGGAC 
GCTGTATGTGCACACCCTCACC^TCACCTCCAAGGGCA^ 

ACTACCAGCCTGCCCAGGACCKjGCTGCAACCCCACCTCCTGGAGATGCrrGATTCAGCTGCCG 

GCCAACTCAGTCACCAAGGTTTCCATCC^GTTTGAGCGGGCGCTGCTGAAGTGGACCGAGTA 

CACGCCAGATCCTAACCATGGCTTCTATGTCAGCCCATCTGTCCTCAGCGCCCTTGTGCCCA 

GCATGGTAGCAGCCAAGCCAGTGGACTGGGAAGAGAGTCCCCTCTTCAA<^GCCTGTTCCCA 

f3TCTgrQATGGCT CTAACTACTTTGTGCGGCTCTACAOGGAGCCGCTGCTGGTGAA CCTGCC 

GACACCGGACTTCAGCATGCCCTACAACOTGATCTGCCTCACGTGCACTGTGGTGGC 

GCTACGGCTCCTTCTACAATCTCCTCA<X:CGAACCTTCCACATCGAGGAGCCCCGCACAGGT 

GGCCTGGCCAAGCGGCTGGCO^CCTTATCCGGCGCGCCCGAGGTGTCCCCC^CTCl^TT 

CTTGCCCTTTCCAGCAGCTGCAGCTGCCGTTTCTCTCTGGGGAGGGGAGCCCAAGGGCTGTT 

TCTGCCACTTGCTCTCCTCAGAGTTGGCTTTTGAACCAAAGTGCCCTGGACCAGGT 

CTACAGCTGTGTTGTCCAGTAC^GGAGCCAC<^GCCAAATGTGGCATTTGAATTTGAATTAA 

CTTAGAAATTC^TTTCCTCACCTGTAGTGGCCACCTCTATATTGAGGTCCTC^TAAGa^ 

AGTGGTCGGTGGCTGCTGTATT<^ACAGCACAGAAAAAGATTTCCATCACCACAGAAAGGTC 

GGCTGGCAGCACTGGCCAAGGTGATGGGGTGTGCTACACAGTGT^ 
TGGAGTTTACTGTTTGTGGAATAAAAACGGCTGTTTCCGTGGAAAAAAAAAAAA 



WO99/630M ^fc/jl O PCT/US99/1220 

FTOTTRE 238 

MPLALLVLLLLGPGGWCLAEPPRDSLREELVITPLPSGDViATFQFRTRWDSBLQREGVSHY 
RLFPKALGQLISKYSLRELHLSFTC^FWRTRYWGPPFLQAPSGAELWVWPQDTVTDVDKSWK 
ELSNVLSGIFCy^LNFIDSTNTVTPTASFlCPLGIiANDTO 

KLLPCSSKAGLSVLLKADRLFHTSYHSQAVHIRPVCRNARCTSISWELRQTLSVVFDAFITG 
QGKKDWSLFRMFSRTLTEPCPLASESRVYVDITTYNQDNETLEVHPPPTTTYQDVILGTRKT 
YAIYDLLDTAMI1TOSRNLNIQLKWKRPPENEAPPVPFLHAQRYVSGYGLQKGELSTLLYNTH 
PYRAFP VLLLDTVPWYIiRLYVHTLTI TS KGKENKPS YIHYQPAQDRLQPHLLEML I QLPANS 
VTKVSIQFERALLKWTEYTPDPNHGFYVSPSVLSALVPSMVAAKPVDWEESPLFNSLFPVSD 
GSNYFVRLYTEPIiWLPTPDFSMPYNVICLTCTWAVOTGSFYNLLTRTFHIEEPRTGGLA 

KRliANLIRRARGVPPL 



WO 99/63088 t _ / PCT/US99/122S2 

FTCTTRE 239 

CAAC^TGGGGTCCA.GCAGCTTCTTGGTCCTCATGGTGTCTOT 

CTGTGGAAGGAGTTAAAGAGGGTATAGAGAAAGCAGGGGTTTGCCCAGCTGACAACGTACGC 

TGCTTCAAGTCCGATCCTCCCCAGTGTCACACAGACC^GGACTGTCTGGGGGAAAGGAAGTG 

TTGTTAGCTGCACTGTGGCTTCAAGTGTGTGATTCCTGTGAAGGAACTGGAAGAAGGAGGAA 

ACAAGGATGAAGATGTGTCAAGGCCATACCCTGAGCCAGGATGGGAGGCGAAGTGTCCAGGC 

TCCTCCTCTACCAGGTGTCCTC^GAAATGATGCTGGGTCCTTTCTACCTCTGGGGGTCACTC 

TCACTTGGCACCTGCCCCTGAGGGTCCTGAGACTTGGAAT^ 

CCACCAAAGAAAACCTGAGCTTGAAGTCCTTTTCCCCAAAAA 

TCCAGACCCCAGGGACGGTACTTTCCCTCTCTACCrGGTGCTCCrCCCTAA 

GGACCCCTCATGAATGAAACCAGTGCCCTTATAAGAGACCCCAAAGAGCTGCCT^^ 

TGCAATGTGTGATCACAGCTAGAAGGCACTGTCAGAGAAGAGAAACTGGTCCTCACCAGATG 

CTGAATCTGCTGGTGCCTTGATCTTGGACTTCCCAGCCrrCTAGAACTGTAAGAAATAAA 

TTGCTGTTTATAATCCAA 



WO 99/63088 




PCT/US99/U252 



FTGTJRE 240 

MGSSSFLVLMVSLVLVTLVAVEGVKEQIEKAGVCPADNVRCFKSDPPQCHTDQDCLGERKCC 
YLHCGFKCVr P VKELEEGGNKDEDVSRP YPEPGWEAKCPGS S STRCPQK 
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FIGURE 241 

AAACTC^GGACTTGCCGGAGTGGCTCATT 

AAACCTGAGCGGTGAGACTCCCAGCTGCCTACATCAAGGCCCCAGGACATGCAG 

T ,~rar;& a rrrr!arrrACCACCATQAGGTCCTGCCTGTGGAGATGCA<X3CACCTGAGCCAAGG 

CGTCCAGTGGTCCTTGCTTCTGGCTGTCCTGGTCTTC^ 

TTAAGGAGCCTCAAACAAAGCCTTCCAGGCATCAACX3CACAGA 

CTACAGTCCCTGGCAAAGCCTAAGTCCCA<^CACCCACAAGGGCGAGGAGGAG^CC^TCTA 

TGCAGAGCCAGCGCCA6AGAACAATGCCCTCAACACACAAACCCAGCCCAAGGCCCA 

CCGGAGACAGAGGAAAGGAGGCCAACCAGGCACCGCCGGAGGAGCAGGACAAGGTGCCCXAC 

ACAGCACAGAGGGCAGCATGGAAGAGCCCAGAAAAAGAGAAAACCA^ 

ACCCAGAGGGCAAGATGCAGGGATGGCCTCTGGCAGGACAGAGGCACAATCATGGAAGAGCC 
AGGACACAAAGACGACCC^GGAAATGGGGGCCAGACCAGGAAGCTGACGGCCTCCAGGACG 
GTGTCAGAGAAGCACCAGGGCAAAGCGGCAACCACAGCCAAGACGCTCATTCCCAAAAGTCA 
GCACAGAATGCTGGCTCCCACAGGAGC^TGTCAAC^GGACGAGACAGAAAGGAGTGACCA 
CAGCAGTCATCCCACCTAAGGAGAAGAAACCTCAGGCCACCCCACCCC!CTGCCCCTTTCCAG 
AGCCCCACGACGCAGAGAAACCAAAGACTCAAGGC^ 

GGATTTTGAGGAAAAATACAGCTT(X5AAATAGGAGGCCTTCAGACGACTTGCCCTGACTCTG 

tgaagatcaaagcctccaagtcgotgtggctccagaaactctttctgcccaacctcactctc 
ttcctggactcc^gacacttgaaccagagtgagtgggaccgcctggaacactttgcaccacc 
ctttggcttcatggagctcaactactccttggtgcagaaggtcgtgacacx3cttccct 
tgccccag<^cagctgctcxtggccagcc^ 

gccgtggtgggcaackxsgggc^tcctgaacaactcccacatgggccaggagatagacagtca 

cgactacx3tgttccgattgagcggagctctcattaaaggctacgaacaggatgtggggactc 

ggacatccttctacxmctttaccgccttctccctgacccagtcactccttatattgggcaat 

cggggtttcaagaacgtgcctcttgggaaggacgtccgctacttgcacitcctggaaggc^ 

ccgggactatgagtggctggaagcactgcttatgaatcagacggtgatgtcaaaaaaccttt 

tctggttcaggcac^gaccccaggaagcttttcgggaagccctgcacatggacaggtacx:tg 

ttgctgcaccca<5actttctccgatacatgaagaacaggtttctgaggtctaagaccctgga 

tggtgcccactggaggatataccgccccac<^ctggggccctcctgctgctcactgcccttc 

agctctgtgaccaggtgagtgcttatggcttcatcactgagggccatgagcgcttttctgat 

cactactatgatacatcatggaagcggctgatcttttaca^ 
gagagaagtctggaagcggctacacgatgaagggataatccxkxttgtaccagcxstcctggtc 

rvrena a nvypa A &nrcAAGAACTSA<^OGGGGCCAGGGCTGCCATGGTCTCCTrGCCrGCTC 

CAAGGCACAGGATACAGTGGGAATCTTGAGACTCTTTGGCCATTTCCCATGGCTCAGACT 

GCTCCAAGCCCTTCAGGAGTTCCAAGGGAACACTTGAACCATGGACAAGACTCTCTCAAGAT 

GGCAAATGGCTAATTGAGGTTCTGAAGTTCTTCAGTACATTGCTGTAGGTCCTGAGGCCAGG 

GATTTTTAATTAAATGGGGTGATGGGTGGCCAATACCAC^TTCCTGCTGAAAAACACTCTT 

CCAGTCCAAAAGCTTCTTGATACAGAAAAAAGAGCCTGGATTTACAGAAACATATAGATCTG 

GTTTGAATTCCAGATCGAGTTTACAGTTGTGAAATCTTGAAGGTATTACTTAACTTCACTAC 

AGATTGTCTAGAAGACCTTTCrAGGAGTTATCTGATTCTAGAAGGGTCTATACTTGTCCTTG 

TCTTTAAGCTATTTGACAACTCTACGTGTTGTAGAAAACTGATAATAATACAAATGATTGTT 

GTCCATGGAAAGGCAAATAAATTTTCTACAGTCAAAAAAAAAAAAAAA 
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FTGTJRE 242 

MRSCLWRCRHLSQGVQWSLLLAVLVFFLFALPSFIKEPQTKPSRHQRTENIKERSLQSLAKP 
KSQAPTRARRTTIYAEPAPENNALNTQTQPKAHTTGDRGKEANQAPPEEQDKVPHTAQRAAW 
KSPEKEKTMVNTLSPRGQDAGMASGRTEAQSWKSQDTKTTQGNGGQTRKLTASRTVSEKHQG 

KAATTAKTLI PKSQHRMLAPTGAVSTRTRQKGVTTAVI PPKEKKPQATPPPAPFQSPTTQRN 
QRLKAANFKSEPRVTOFEEKYSFEIGGLQTTCPDSVKIKASKSLWLQKLFLPNLTLFIJDSRHF 
NQSEWDRLEHFAPPFGFMELNYSLVQKVVTRFPPVPQQQLLLASLPAGSLRCITCAVVGNGG 
ILNNSHMGQEIDSHDYVFRLSGALIKGYEQDVGTRTSFYGFTAFSLTQSLLILGNRGFKNVP 
LGKDVRYLHFLEGTRDYEWLEALLMNffTVMSKNLFW^ 

RYMKNRFLRSKTLDGAHWRIYRPTTGAI^IATALQLCDQVSAYGFITEGHERFSDHYYDTSW 
KRLIFYINHDFKLEREWKRLHDEGI IRLYQRPGPGTAKAKN 



WO 99/63088 -f Z PCT/US99/122S2 

FIGURE 243 

CGATGCGCGGACCCGGGCACCCCCTCCTCCrc 

GAGCAGCGAGTGGAAATTGTTCCTCGAGATCTGAGGATGAAGGACAAGTTTCTAAAACACCT 

TACAGGCCCTCTTTATTTTAGTCCAAAGTGCAGCA 
CCAGAGACTG<^CCATTCCTOCATACTATAAAAGATGC^^ 

GTCAGTCCAGTGTGCATGGAGGATAAGIGAGCAGACCGTACAGGAGCAGCACACCAGGAGCC 

ATGAGAAGTGCCTTGGAAACCAAC^GGGAAACAGAACTATCTTTATACACATCCC 

ACAAGAGATTTATTTTTGCAGACAGACTCTTCCAT^ 

ACAGTTTGC^GATATATATTCGATAAATCAGTGTACTTGACAGTGTTATCTGTCACITA 
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FIGURE 244 

MRGPGHPLLLGLLLVLGPSPEQRVEIVPraLRMKDKPLKHLTGPLYFSPKCSKHPHRLYHNT 
RDCTI PAYYKRCARLLTRLAVSPVCMEDK 
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FTGTJRE 24S 

GGGCTGGGCCC05CCG<^GCrrCCAGCTGGCCGGCTTGGTCCTGCGGTCCCTTCTCTGGGAGG 

CCCGACCCCGGCOXXSCCCAGCCCCCACC^^ 
CTCACCGCAATGGCTCTGTTGGTGCIX3GGGGCTCCC 

GT<^TACCTGGACOGGAATGGCTCCTX3GCATCCGGGGTTTAACTGCGAGTTCTTCACCTTCT 

GCTGCGGGACCTGCTACCATCGGTACTGCTGCAGGGACCTGACCTTG^ 

CAGCAGAAGCACTGCCTGGCCTTCAGCCC^AAGACCATAGCAGGCATC^ 

CCTCTTTGTTGCTGTGGTTGCCACCACCATCTGCTGCTTCCTC^ 

ACCGCCGGCX3CCAGCAGCTCCAGAGCCCATTTGAAGGCCAGGAGATTCCAATGACAGG 

CCAGTGCAGCCAGTATACCCATACCCCC^GGACCCCAAAGCTGGCCCTGCACCCCCACAGCC 

TGGCTTCATGTACCCACCTAGTGGTCCTGCTCCCCy^TATCCACTCrACCCAGCTGGGCCCC 

CAGTCTACAACCCTGCAGCT^CTCCTCCCTATATGCCACCIACAGCC^TCTTACCCGGGAGCC 

TGAGGAACCAGCCATGTCTCrrGCISrc^ 

CCTGTACCTGCATCTGGTCCTGGGGGTGGCAGGAGTCCTCCAGCCACCAGGCGCCAGACCAA 

GCCAAGCCCTGGGCCCTACTGGGGACAGAGCCCCAGGGAAGTGGAACAGGAGCTGAACTAGA 

ACTATGAGGGGTTGGGGGGAGGGCTTGGAATTATGGGCTATTTTTACTGGGGGC 

GAGATGACAGCCTGGGTCACAGTGCCTGTTTTCAAATAGTCCCTCTGCTCCCAAGATCCCAG 

CCAGGAAGGCTGGGGCCCTACTGTTTGTCCCCTCTGGGCTGGGGTGGGGGGAGGGAGGAGGT 

TCCGTCAGCAGCTGGCAGTAGCCCTCCTCTCTGGCTC 

CTAGATTAAAGCTGTAAAGACAAAA 



WO 99/63088 ^ v PCT/US99/12252 

FTCTJRE 246 

MPPAGLRRAAPLTAlALLVLGAPLVIAGEDCLWyLDRNGSWHPGFNCEFFTFCCGTCYHRYC 
O^LTLLITERQQKHCLAFSPKTIAGIASAVILFVAVVATTICCFLCSCCYLYRRRQQIiQSP 
FEGQEIPMTGIPVQPVYPYPQDPKAGPAPPQPGFMYPPSGPAPQYPLYPAGPPVYNPAAPPP 

YMPPQPSYPGA 
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FTGIJRE 247 

GGGGGAGCTAGGCCQGCGGCAGTGGTGGTG0CGGCGGCGCAAGGGT6AGGGCGGCCCCAGAA 
CCCCAGGTAGGTAGAGCAAGAAGAT^TGTTTCTGCGCCTCAAATGGTCCOTTGC^CaVTG 
TCATTTCTACTTTCCTC^CTGTTGGCTCTCTTAACTGTGTCCACTCCTTCATGGTGTCAGAG 
CACTGAAGCATCTCCAAAACGTAGTGATGGGACACCATTTCCTTGGAATAAAATACQA.CTTC 
CTGAGTACGTCATCCCAGTTCATTATGATCTCTTGATCCATGCAAACCTTACCACGCTGACC 
TTCTGGGGAACCACGAAAGTAGAAATCACAGCCAGTCAGCCCACCAGCACCATCATCCTGCA 
TAGTCACCACCTGCAGATATCTAGGGCCACCCTCAGGAAGGGAGCTGGAGAGAGGCTATCGG 
AAGAACCCCTGCAGGTCCTGGAACACCCCCCTCAGGAGCAAATTGCACTGCrGGCTCCCGAG 
CCCCTCCTTGTCGGGCTCCCGTACACAGTTGTCATTCACTATGCTGGCAATCTTTCGGAGAC 

tttccacggattttacaaaagcacctacagaaccaaggaaggggaactgaggatactagcat 
caacacaatttgaacccactgcagctagaatggcctttccctgctttgatgaacctgccttc 

aaag caagtttctcaatcaaaattagaagagagcc aaggcacctagc catctc caatatgcc 

attggtgaaatctgtgactgttgctgaaggactcatagaagaccattttgatgtcactgtga 

agatgagcaccratctggtggccitcatcatttcagattttgagtctgtcagcaagataacc 

aagagtggagtcaaggtttctgtttatgctgtgccagacaagataaatcaagcagattatgc 

actggatgctgcggtgactcttctagaattttatgaggattatttcagcataccgtatcccc 

tacccaaagaagatcttgctgctattcccgactttcagtctggtgctat^ 

ctgacaacatatagagaatctgcrctgttgtttgatgcagaaaagtcttctgcatcaagtaa 

gcitggcatcac^gtgactgtggcccatgaactggcccaccagtggtttgggaacctggtca 

ctatggaatggtggaatgatctttggctaaatgaaggatttgccaaatttatggagtttgtg 

tctgtcagtgtgacccatcctgaactgaaagttggagattatttctttggcaaatgttttga 

cgcaatggaggtagatgctttaaattcctcacaccctgtgtctacacctgtggaaaatcx:tg 

ctcagatccgggagatgtttgatgatgtttcttatgataagggagcttgtattctgaatatg 

CTAAGGGAGTATCTTAGCGCTGAC<3C^TTTAAAAGTGGTATTGTACAGTATCTCCAGAAGCA 
TAGCTATAAAAATACAAAAAACXaAGGACCTOTGGGATAGTATGGCAAGTATTTGCXICTACAG 
ATGGTGTAAAAGGGATGGATGGCTTTTGCTCTAGAAGTCAACATTCATCTTCATCCTC!ACAT 

tggcatcaggaaggggtggatgtgaaaaccatgatgaacacttggacactgcagaggggttt 
tcccctaataaccatcacagtgagggggaggaatgtacacatgaagcaagagcactacatga 

AGGGCTCTGAGGGCGCCCCGGACACTGGGTACCTGTGGCATGTTCCATTGACATTCATCACC 
AGCAAATCCAACATGGTCC^TCGATTTTTGCTAAAAACAAAAACAGATGTGCTCATCCTCCC 
AGAAGAGGTGGAATGGATCAAATTTAATGTGGGCATGAATGGCTATTACATTGTGCATTACG 



aatgatcgggcaagtctcattaacaatgc^tttcagctcgtcagcattgggaagctgtccat 
tgaaaaggccttggatttatccctgtacttgaaacatgaaactgaaattatgcccgtgtttc 
aaggtttgaatgagctgattcctatgtataagttaatggagaaaagagatatgaatgaagtg 
gaaactcaattc^ggccttcctcatcaggctgctaa<x;gacctcattgataagcagacatg 
gacagacgagggctcagtotcagagcaaatgctgcggagtgaactactactcctcgcctgtg 
• tgcacaactatcagccgtg<x;tacagagggcagaaggctattt<^gaaagtggaaggaatcc 
aatggaaacttgagcctgcctgtcgacgtgaccttggcagtgtttgctgtgggggcccagag 
cacagaaggctgggattttctttatagtaaatatcagttttctttgtccagtackxagaaaa 
gccaaattgaatttgcccrrctgcagaacccaaaataaggaaaagcttcaatggctactagat 
gaaagctttaagggagataaaataaaaactc^ggagtttccacaaattcttacactcattgg 
caggaacccagtaggatacccactggcctggcaatttctgaggaaaaactggaacaaacttg 
tacaaaagtttgaacttggctcatcttccata<3cccacatggtaatgggtacaacaaatcaa 
ttcrccac^^gaacacggcrtgaagaggtaaaaggattcttcagctctttgaaagaaaatgg 
ttctcagctccgttgtgtccaacagacaattgaaaccattgaag^aaacatcggttggatgg 
ataagaattttgataaaatcagagtgtggctgcaaagtgaaaagcttgaacgtatgtaaaaa 
ttcctccgttgcccggttcctgttatctctaatcaccaacattttgttgagtgtattttcaa 
actagagatggctgttttggctccaacpggagatacttttttcccttcaactcattttttga 
ctatccctgtgaaaagaatagctgttagtttttcatgaatgggctttttcatgaatgggcta 
tcgctaccatgtgttttgttcatc^caggrrgttgccctgcaacgtaaac^caa 
tccctgckzacagaagaataaagtaccttattcttctcaaaaaaaaaaaaaaaaaaa^ 
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FIGURE 248 

MVFLPLKWSIATMSFLLSSLLALLTVSTPSWCQSTEASPKRSIXSTPFPWNKIRLPByVIPVH 

YDLLIHANLTTLTFWGTTKVE ITASQPTST 1 1 LHSHHLQI SRATLRKGAGERLSEEPLQVLE 
HPPQEQIALLAPEPLLVGLPYTWIHYAGNLSETFHGFYKSTYRTKEGELRILASTQFEPTA 

ARMAFP CFDE PAFKAS FS I K I RRE PRHLAI SNMPL VKS VTVAEGL I EDHFDVTVKMSTYLVA 
FIISDFESVSKITKSGVKVSVYAVPDKINQADYALDAAVTLLEFYEDYFSIPYPLPKQDIiAA 
I PDFQSGAMENWGLTTYRESALLFDAEKSS ASSKLG ITVTVAHELAHQWFGNLVTMEWWNDL 
WLNEGFAKFMEFVSVSVTHPELKVGDYFFGKCFDAMEVDALNSSHPVSTPVENPAQIREMFD 
DVS YDKGACI IiNMLRE YLS ADAFKSGI VQYWKHS YKNTKNEDLWDSMAS I CPTDGVKGMDG 
FCSRSQHSSSSSHWHQEGVbVKTMMNTWTLQRGFPLITITVRGRNVHMkQEHYMKGSDGAPD 
TGYLWHVPLTFITSKSNMVHRFLLKTKTDVLILPEEVEWIKFNVGMNGYYIVHYEDDGWDSL 
TGLLKGTHTAVSSNDRASL INNAFQLVS IGKLS I EKALDLSL YLKHETE I MPVFQGLNEL I P 
MYKLMEKRDMNEVETQFKAFLIRLIJiDLID^^ 

QRAEGYFRKWKESNGNLSLPVDVTLAVFAVGAQSTEGWDFLYSKYQFSLSSTEKSQIEFALC 
RTQNKEKLQWLLDESFKGDKIKTQEFPQILTLIGRNPVGYPLAWQFLRKNW^nOjVQKFEIjGS 
SSIAHMVMGTTNQFSTRTRLEEVKGFFSSLKENGSQLRCVQQTIETIEENIGWMDKNFDKIR 

VWLiQSEKLERM 
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-e^S /?>vo 
FTCTJRE 249 

r&flrrRrAGAgG<^TCATGAGCGCGGTATTACTGCTGGCCC!TCCTGGGGTTCATCCTCCCA^ 

TGCCAGGAGTGCAGGCGCTGCTCTGCC^GTTTGGGACAGTTCAGCATC 

GACCTACCCCGGCAATGGACCCCTAAGAACACCAGCTGCGACAGCGGCrTGGGGTGCCAGGA 

CACGTTGATGCTCATTGAGAGCGQACCCCAAGTGAGCCTGGTGCTCTCCAAGGGCTGCACGG 

AGGCCAAGGACCAGGAGCCCCGCGTCACTGAGCACCGGATGGGCCCCGGCCTCTCCCTGATC 

TCCTACACCTTCGTGTGCCGCCAGGAGGACTTCTGCAACAACCTCGTTAACTCCCTCCCGCT 

TTGGGCCCCACAGCCCCCAGCAGACCCAGGATCCTTGAGGTGCCXlAGTCTGCrTGTCT^ 

AAGGCTGTCraGAGGGGACAACAGAAGA<^TCTGCCCCAAGGGG^ 

GGCCTCCTCAGGCTCAGGGGAGGAGGCATCTTCTCCAATCTGAGAGTCCAGGGATGCATGCC 

CCAGCCy^GTTGCAACCTGCTCAATGGGACACAGGAAATTGGGCCCGTGGGTATGACTGAGA 

ACTGCAATAGGAAAGATTTTCTGACCTGTCATGGGGGGACCACCATTATGA 

TTGGCTCAAGAACCCACTGATTGGACCACATCGAATACCGAGATGTC^ 

GTGTCAGGAGACGCTGCTGCTCATAGATGTAGGACTCACATC^CCCrrGGTGGGGACAAAAG 

GCTGC^GCACTGTTGGGGCTCAAAATTCCCAGAAGACCACCATCCACTC^CCCCTCCTGGG 

GTGCTTGTGGCCTCCTATACCCACTTOTGCTCCTCGGACCTGTGCAATAGTGCCAGCAGCAG 

CAGCGTTCTGCTGAACTCCCTCCCTCCTCAAGCTGCCC^ 

CTACCTGTGTGGAGCCCCTTGGAACCTGTTCAAGTGGCTCC^ 

GGCX3CCACTCATTGTTATGATGGGTACATTCATCTCTCAGGAGGTG<JGCTGTCCACCAAAAT 
GAGCATTCAGGGCTG Q3TGGCCCAACCTTCCASCTTCTTCTTC 

GGATCTTCTCTGOTCGTGAGAAGCGTGATGTGCAGCCTCCTGCCTCTCAGCATGAGGGAGGT 

GGGGCTGAGGGCCTGGAGTCTCTCACTTGGGGGGTGGGGCTGGCAC^ 

GTGGGGAGTGGTTTGCCCTTCCrGC2^CTCTATTACC^^ 

CCACCCACACTCAACCTCCCTCTGACCTCATAACCTAATGGCCTTGGACACCAGATTCTTTC 
CCATTCTGTCGATGAATCATCTTCCCCACACACAATGATTCATATCTACT 
ACACTGGGGAGAGCCTGGAGCATCCGGACTTGCCCTATGGGAGAGGGGACX3CrreGAGGAGTG 
GCTGCATGTATCTGATAATACAGACCCTGTCCTTTCA 
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FTQIIRE250 



MSAVLLLMjLGFILPLPGVQALLCQFGWQHVWKVSDLPRQWTPKNTSCDSGLGCQDTLMLI 
ESGPQVSLVLSKGCTEAKDQEPRVTEHRMGPGLSLISYTFVCRQEDFCNNLVNSLPLWAPQP 
PADPGSLRCPVCLSMEGCLEGTTEEICPKGTTHCYDGLLRLRGGGIFSNLRVQGCMPQPGCN 
LLNGTQEIGPVG^ENC^KDFLTCHRGTTIMTHGNLAQEPTDWTTSirrEMCEVGQVCQETL 
IilDVGLTSTLVGTKGCSTVGAQNSQKTTIHSAPPGVLVASYTHFCSSDLCNSASSSSVLLN 
SLPPQAAPVPGDRQCPTCVQPLGTCSSGSPRMTCPRGATHCYDGYIHLSGGGLSTKMSIQGC 
VRQPSSFLLNHTRQIGIFSAREKRIJVQPPASQHEGGGAEGLESLTWGVGLAIA 
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FIGURE 251 

GCGAOGGGCAGGACGCCCCGTTCGCCT^ 

CAGGATGAGGGGGAATCTGGCCCTGGTGGGCGTTCT 

CCATCTGGACATCCTCAGCCX^CrrGGCGATGACGCCrKjCTCT 

CCTCAAAGGGGATGCGGGAGAGAAGGGAGACAAAGGOSCC^^ 

GCCCCACGGGAGAAAAAGGAGACATGGGGGAOVAAGGACAGAAAGGCAGTC 

GGAAAAATTGGTCCCATTGGCTCTAAAGGTGAGAAAGGAGATTCCGGTGACATAG 

TGGTCCTAATGGAGAACCAGGCCTCCCATGTGAGTGC 

AGATGGACAACCAGGTCTCTCAGCTGACCAGCC^ 

GGTGTGCGCGAGACGGAGAGCAAGATCTACCTGCTGGTGAAGGAGGAGAAGCGCTA 

CGCCCAGCTGTCOTGCCAGGGCCGCGGGGGGACGCTGAGCATGCCCAAGGAC 

ATGGCCTGATGGCCGCATACCTGGCGCAAGCCGGCCTGGCCCGTGTCTTCATCGGCATCAAC 

GACCTGGAGAAGGAGGGCGCCTTCGTGTACTCTGACCACTCCCCCATGTO 

GTGGCGC7VGCGGTGAGCCCZAACAATGCCTACGACGAGGAGGACTGCGTGGAGATGGTGGCCT 

CGGGCGGCTGGAACGACGTGGCCTGCCACACCACCATGTACTTCATGTGTGAG 

GAGJ^ACATGT^GCCTCAGGCTGGGGCTGCCCATTGGG 

GGCAGGGACAGAGCCCAGACCATGGTGCCAGCCAGGGAGCTGTCCCTCTC 

GCTCACTGAGTAGAGGGCTGTTGTCTAAACTGAGAAAATGGCCTATGCTTAAGAGGAAAATG 

AAAGTGTTCCTGGGGTGCTGTCTCTGAAGAAGCAGAGTTTCATTACCTGTATTGTAGCCCCA 

ATGTCATTATGTAATTATTACCCAGAATTGCTCTTCCATAAAGCTTGTGCCTTTC 

TATACAATAAAATCTTTAAGTAGTGCAGTAGTTAAGTCCAAAAAAAAAAAAAAAAAAA 
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FIGURE 252 

MRGNLALVGVLISIAFLSLLPSGHPQPAGDDACSVQILVPGLKGDAGEKGDKGAPGRPGRVG 
PTGEKGDMGDKGQKGSVGRHGKIGPIGSKGEKGDSGDIGPPGPNGEPGLPCECSQLRKAIGE 
MDNQVSQLTSELKFIKNAVAGVRETESKIYLLVKEEKRYADAQLSCQGRGGTLSMPKDEAAN 
GLMAAYIAQAGLARVFIGINDLEKEGAFVYSDHSPMRTFNKV^GEPNNAYDEEDCVEKV7^S 

GGWNDVACHTTMYFMCEFDKENM 
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FTCTIRF, 2S3 

AGTGACTGCAGCCTTCCTAGATCCCCTCCACTCXMTTTCTCTCTTTGCA<3GAfiCACOGGCAfi 

CACCAGTGTGTGAGGGGAGCAGGCAGCGGTCCTAG^ 

CAGCCCCCGGCACAGAGCTGCTCCAC^GGCACC^^ 

CCTGGCCTTCAGCCTAGCTCAGAGCTTTGGGGCTGTCrGTAAGGAGCCACA 

TTCC1X3GCGGGGGCCGCAGCAAGAGGGATCCAGATCTCTA 

AAAAGCCACTCATCTCTGGAGGGATTGCTCAAAGCCCTGAGCCIAQGCT 

GGAATCAACATCTCCCGAGAAACGTGACATGCATGACTTCTT1X3TGGGACTTATGGGCAAGA 

GGAGCGTCCAGCCAGAGGGAAAGACAGGACCTTTCTTACCTTCAGTGAGGGTTCCTCGGCCC 

CTTCATCCCAAT<^GCTTGGATCCACAGGAAAGTCTTCCCTGGGAACAGAGGAGCAGAGACC 

TTT ATAAG ACTCTCCTACGGATGTGAATCAAGAGAACGTCCCCAGc-riTGGCATCCTCAAGT 

ATCCCCCCAGAGCAGAATAGGTACTCCACTTCCGGACTCCTGGACTGCATTAGGAAGACCTC 

TTTCCCTGTCCCAATCCCCAGGTGOGCAOGCTC 
AACATTCTTGTGCTTTGACTCCTTCTCCATCTTTTCTACC^ 

TAGTGAATATCCCCAACCCCAATGGGCATTGACTGTAGAATACCCTAGAGTTCCTGTAGTGT 
CCTACATTAAAAATATAATGTCTCTCTCTATTCCTCAACAATAAAGGATTTTTGCATATGAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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WO99/63088 ^og/^Vv) 

FTOTTRE254 

MRIMLLFTAIIAFSLAQSFGAVCKEPQEEVVPGGGRSKRDPDLYQLLQRLFKSHSSLEGLLK 
ALSQASTDPKESTSPEKRDMHDFFVGLMGKRSVQPEGKTGPFLPSVRVPRPLHPNQLGSTGK 

SSLGTEEQRPL 
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FIGURE 255 

GGGCGTCTCCGGCTGCTCCTATTGAGCTGTCTGCTCGCTGTGCCCGCTO 

CGCGCTGTCX3CCGCTGCTACCGCGTCTGCTGGACGC<3GG^ 

GAGCCCTGCGGAGAGCTCAAGCGCCCAGCTCTGC 

CCATAGTTGCTGCAGGAGTGGAGCCATCAGCTCCGTCCT 

TGCTGTTCCTGGTCTGCGGATCCCAAGGCTACCTCCTGCCCAACGTCACT 

CTGCTCAGCAAATACCAGCACAACGAGTCTC^ 

GGACAAGK5AGGAGATCCTCATGCIGCACAACAAGCT 

CCAACATGGAGTACATGGTGAGC<3CCGGCTCCGGCCGCAGAGGCTGGC^ 

CTGGGCCACCAGCCTGCTCTGTTCCCCAGCCAGCT^ 

CTGGCTCAGGGTCTCCTCTGGCAGGK?GAGGA^ 

TTGAGACAGW3GTCTCACTCTGCCACTGACGCT 

AAACCTT^ACTCCCGGGGTTAAGOSATCCTGCTTCAGCCTCCCAAGTAGCTGG 

GCATGCACCATGGTGCCCAGCTAGATTTTAAATATTTTGTGGAGA 

TGCCCAGGCTGGTCTTGAACTCCTAGGCT 

CTAGGATTATAGGCATGAGTCACCGTGTCTG 

ACAACACACGTGGGTTCCCTGTGCAGAGCCTGCCTGGTTC 

TCCACTGGGAACACAGCTCTCAGGCTTTCGGA 

CTGGGCTTTGCTGATGCTGATCTCAGCTGTGCCA 

CTTAGCCCGTGTGAGCCTCACTTTCCACTTG^ 

GTGAGATAAGTCGAGGCTGTGAAGGGCCCXSKSCACA^ 

CTTTGCTAACCGGGAAAGGAGCTAACGGTGACAGAAGAC^^ 

GATTGTGATGGGTGTTGCAGGTGTGGTTGGGCGATGGTGCT^ 

TGGAAACTTCCTTCCTGGCTGGTTTTCCAG 

GTCCCTCCTCGTCCACCAAGCGGGAGCCTCCAC^ 

TTTAAACAAACCCACGTCCCAGCCTGGGTAAGATGGTAAAGCGCGGTC 

CAAGTTAGCCGGGCATGGTGGTGCGCAGCTGT^ 

GTGGAGGTGGGGGGTGGGAG CTGAGGAAGGAGGATCGCTTGAGCCTGGGAAGTCGAGGCTGC 
AGTGAGCTGAGATTGCACGAGTGCACTGGAGCCTO 
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FTGTTRE 256 

MS CVLGGVI PLGLLFLVCGSQGYLLPNVTLLEELLSKYQHNE SHSRVRRA I PREDKEE ILML 
HNKLRGQVQPQASNMEYMVSAGSGRRGWHRGWGLGHQPALFPSQLCSPASACDGWLRVSSGR 

GGSRLCSVLFVCFETGSHSATDAGVQWHNRHALKP 
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FIGURE 257 

AA<MAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACTSEJ3^ 
GTCTGGGCTGCCCCITGTCCTCCTCrTGACCCTCCTTG 

GTAT<^CTTTGC^CTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGC 
TTCCTGGAATTGCTTGAAAAGCTCTGCCTCCTCCT<^ATCTC 

CCTCCACCATGCAAGATCTCAACACCATGTTGTCTGQIACACASGACAGCCATTGAAGCCTG 

TGTCCTTCTTGGCCCGGGCTTTTGGGCCGGGGATGCAGG^ 

CAGCAGGCCCCCACCCTCCTGAGTGGCAATAAATAAAATTCGGTATGCTG 



WO 99/63088 ^ &l.}l>\ O PCT/US99A22S2 

FTQIJRE 258 

MGS6LPLVLLLTLLGSSHGTGPGMTLQLKLKESPLTNSSYESSFLELLEKLCLLLHLPSGTS 
VTLHHARSQHHWCNT 



WO 99/63088 °&£3/^0 PCT/US99/12252 

FIGURE 259 

AATTGTATCTGTGTAATGTTAAAACAAACGAAATAA^ 

CAAAAACAACAGACTAGTACTCTAAAGAACTCT^ 

TATGATTGGATATTATTTAATTCTGTTTCTGATGTGGGGTTCOT 

TATTAATATTTACCATTGCAGAAGCTTCATTCAGTGTTGAAAAT 

TGCCTCTTACGCATATGTTACAAATTATCT 

CTCCGATTGTTCTAA ATAA TTGAAAGATGTCTGCT 

TATGATTCTCAACCATCTTTAGTTGGGA 

TTTTCITGGCACTAATCAAGTGAGTGTTACCTTTTCA 

GTAAAATAGAAACCTGTGTTTATTCTCAGGTATTTTAGAAACAAC^ 

ATGTGTGTGTTCTTGGCTGTATTCATAAATTATATATTTTGGGCTATCAAATATTACT 

TCAATATAAATAACAATAGTAGAAGTTGTTTACTTAGATATGCITTCTA<3 

AGCCTATGTAAGACTACTTTGTTGTAATAGCCTTTGAAATTTACAG 

TCTTCAGATTACTTGATTCAAATAAACCAATTA 

ATAAAAGTTCATATCTACCC 



WO 99/63088 , PCT/US99/12252 

FTftTTRE 260 

MIGYYLILFLNWGSSTVFCVLLIFTIAEASFSVENECLVDLCLLRICTKLSGVPNQCRVPLP 
SDCSK 
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FIGURE 261 

GAGGATTTGCCACAGCAGCGGATAG 

G AG AAGAGCCACAGCATT^GAGACTGCCCTGCTTGGTGTTTTG CAGGATGATGGTGGCCCTT 

CGAGGAGCTTCTGCATTGCTGGTTCTGTTCCTTGC^ 

CCAGGACCCAGCCATGGTGCATTACATCTACC^ 

AAAAATGTACCCAAGCAACGAGGGCATACATTC^^GAATTCCAAGAGTTCT 
TCTGTCATGCTGGGAAGATGTCAGACCTACACAAGTC 
GteCACTGAGAGTTGAACGTGCCCAACGGGAGATTGACT 
ACGAGTGCATCGTATCAGAGGACAAGACACTGGCA 

GAGAAAAAGATCCGGACTCTGCTGAATGCAAGCTGTGACAACATGCTGATG^ 

TTTGAAAATAGTGAAGAAGATGATGGACACACATGGCTCTTGGATGAAAGATGCT 

ACTCTCCAAAGGTGTACTTATTAATTGGATCCAGA 

ATACGGGCATTCATGGAGGATAACACCAAGCCAGCT 

CTX^CAGGGAACAGGCCAAGTGATCTACAAAGGTTTTCTATT 

CTAATGAGATAATCAAATATAACCTGCAGAAGAGGACTC 

GGAGGGGTAGGCCGAGCATTGGTTTACCAGCACTCCCCCTCAACra 

GGATGAGCATGGGCTCTGGGCCATCCACTCTGGGCCAGGCACCCATAGCCATTTGG 

CAAAGATTGAGCCGGGCAGACTGGGAGTGGAGC^ 

GATGCTGAAGCCTCATTCCTCTTGTGTGGGGTT 

GGGCCCTCATCGCATCACCTGGATCTATGATCCACTGGGCACTATCAGTC 
CCAACTTGTTCTTCCCCAAGAGACCAAGAAGTCACTCCATGATCCATTACAACCCCA 
AAGCAGCTCTATGCCTGGAATGAAGGAAACCAGATCATTTACAAACTCCAGA 
GCTGCCTCTGAAGT^TGCATTAC^ 

TACAGGACAGTGAGGCTATAGCCCCTTCACAATATAGTATCCCTCTAATCACAC^ 

AGTGTGTAGAAGTGGAAATACGTATGCCTCCTTTCCGAAATGTCACTGCCTTAGGTA 

CAAGAGCTTAGATGAGAGCATATCATGAGG 

CCTCCTGGCTCTCAAGGATGACCACATTCTGATAC^^ 

GCTCCCCAGCATTTACTGTAACTCTGCCATCITCCCTCC 

CCCTAATATTCACCACTGGCTTTTCTCTCC 




AGAGCTCAAGAGATCCTCCTGCCTCAGCCTCCTAAGTACCTGG^ 
CACACCTGGCTTAAAATACTATTTCTTATTC 

CTTCCACTAAGCTTGGTAGATGTAATAATAAAGTGAAAATATTAACATTTGA 

CCAGGTGTGGAGTGTTTGCACATCATTGAATTCTCGTTTCACCTT^ 

TCTTTACAGCTGTCATTCTAGAGTTTAGGTGAGTAACACAATTACAAAGTGAAAGA 

TAGAAAATACTACAAATCCCATAGTTTTTCCATO 

TGTTCACCTACTCTTATAGTCAATGCGT^ 

TTTAGCCAGTTTTCATGTCTGCACAAGACCT^ 

AGAAAACCAGTCTAAGGGTGAGGACCCCAACTCTAGCCT 

TTCTCTCTTTCTGCTTTAAATTCAATA 
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FTCTJRE 262 

MMVALRGASALLVLFIiAAFLPPPQCTQDPAMVHYIYQRFRVLEQGLEKCTQATRAYIQEFQE 
F SKN I SVMLGRCQT YTS E YKS AVGNLALRVERAQRE ID Y I Q YLREADEC I VS EDKTLAEMLL 
QEAEEEKKI RTLLNAS CDNMLMGI KS LK I VKKMMDTHG S WMKDAVYNS PKVYLL I GSRNNTV 
WEFANI RAFMEDNTKPAPRKQ I LTLS WQGTGQV I YKGFLFFHNQATSNE 1 1 KYNLQKRTVED 
RMLLPGGVGRALVYQHSPSTYIDLAVDEHGLWAIHSGPGTHSHLVLTKIEPGTLGVEHSVTOT 
PCRSQDAEASFLLCGVLYWYSTGGQGPHRITCIYDPLGTISEEDLPNLFFPKRPRSHSMIH 

YNPRDKQLYAWNEGNQ I I YKLQTKRKLPLK 
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FTCTTRF. 263 

GGGCGCCCGCGTACTCACTAGCTGAGGTGG 
TGTCGGAGCTCATGGGGCTGTCGGTGTTGCCT 

GTAGCGCGGGGGTGGCTGCGCGCGGGGGAGGAGAGGAGCGGCCGGCCCGCCTGCCAAAAAGC 

AAATGGATTTCCACCTGACS^TCTTCGGGATCCAAGAAGCAGAAA 

GGAAGGAGAAGCCTCAACAACACAACTTCACCCACCGCCT 

CACAGCGGGAACATATCTTGCATGGACTTTAGC^^ 

AGATGATCGCACCATCCGCATCTGGAGCACCAAGGACTT 

TGAGAGCCAACGTGGAGCTGGACCACGCCACCCTGGTGCGCnTCAGCCCT 

TTC^TCGTCTGGCTGGCCAACGGGGACAGCCTCCGTGT 

TGGGGGCTACACCTTCACAGCCACCCC^^ 

TCGACATTGGCATTGCTAACACAGGGAAGTTTATCATGACTGCCT 

CTCATCTGGAGCCTGAAGGK3TCAAGTGCTC 

ACACGCTGCTGTATCTCCCTGTGGCAGATTC 

AGGTTTGGGAAGTCTGCTTTGGAAAGAAGGGGGAGTTCCAGGAGGTG^ 

CTAAAGGGCCACTCCGCGGCTGTGCACTCX5TTTGCTTTCT 

TTCTGTCTCCAAGGATGGTACATGGAAACTGTGGGACA 

AGGACCCCTACTTGCTGAAGACAGGCCGCTTTGAAGAGGCGGCGGGTC 

CTGGCCCTCTCCCCCAAaSCCCAGGT 

CAATACCCGGCGGGGCGAGAAGGAGGAGTGCTTTGAGCGGGTCCATGGCGAG 

ACTTGTCCTTTGACATCACTGGCCX3CTTTCTTC 

TTTCACAACACTCCTGGCC^CCGAGC 

CTCCAACGAGAGCACCCGCCAGAGGCTGCAGCAGCAGCTGACCCAGGCCCAAGAGACCCTGA 
AGAGCCTGGGTGCCCTGAAGAAGT^CTCTO 

GGATCTGGCCTCCTCATGGC7VCTGCTGCCATCTTTCCTCCCAGGTC 
AGTCTCCTGGTTTTCTTACTGGTGGCC^ 
— AGGTCTCTCTCTTCTTGCTGGCTGTGACTCCT 
CTCCCAGGCCCAGTGGGTGGAATCTGTCCCC^^ 
AGGAGAGAGAGAGAGAATGTGATTTTTGGCCTTGTGGCA^ 
TTTGTAAATGTTCCAGAACAACCTAGAGAACACCTC 
TGGGAGACTGGGATAGCTTCCCATCACAGAACT 

TCCTTCTGGGCCTCAGTTCTATTTGTAAGATGGAGAATAATCCTCTCTGTGAACT 
AAGATGATATGAGGCTAAGAGAATATCAAGTCCCCAGGTCTGGAAGAAAAGTAGAAAAGAGT 
AGTACTATTGTCCAATGTCATGAAAGTGGTAAAAGTGGGAACCAGTGTGCT^ 
TTAGAAACACATTCCTTGGGAAGGGAAAGTOT 

TGGGACTTCTCTCTTCGGGAGATGATATCTTGTTTAAGGAGACCTCTTTTC^ 
TTCATCAGATATTTGAGTGCCCACTCTGTC 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FTQIJRE 264 

MELSQMSELMGLSVLLGLIALMATAAVARGWLRA 

QYQRIRKEKPQQHNFTHRLLAAALKSHSGNISCMDFSSNGKYIATO^DRTIRIWSTKDFI^ 

REHRSMRANVELDHATLVRFSPDCRAFIVWIJ^GDTLRVFKMTKREDGGYTFTATPEDF 

HKAPVIDIGIANTGKFIMTASSDTTVLIWSLKGQVLSTINTNQMNNTHAAVSPCGRFVASCG 

FTPDVKWEVCFGKKGEFQEVVI^FELKGHSAAVHSFAFSNDSRRMASVSKDGTWKLWDTDV 

E YKKKQDPYLLKTGRFEEAAGAAPCRLALS PNAQVLALASGS S IHLYNTRRGEKEECFERVH 

GECIANLSFDITGRFLASCGDRAVRLFHNTPGHRAMVEEMOGHLKRASNESTRQRLQQQLTQ 



AQETLKS LGALKK 
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FTGURE 265 

TGGCCTCCCCAGCTTGCCAGGCACAAGGCTGAGCGGGAGGAAGC 

(^GTGTTTTGCCTTC^CCCCAAGra 

TCCTAGTAACTGTGTCTGACTCTCGCT 

GGGGCAGGCACCTGOTGTGCCATCAGCCTGTGGC 

GGGGCGGGAAGGCGAGGAGTGCCACCCCGGCAGCCACAAGGTCCCC^ 

AGCACCACACCTGTCCTTGCTTGCCCAACCTGCTGTGCTCCAC^ 

CGCTGCTCCATGGACITGAAGAAC^^ 

CCATCCTTTTCCTGAGCACAGCCTGGATTTTTATTTCT 

TCTCCCAGTCCCTACACTGACTACCCTGAT 

GCAGACATACCTCCCATCATGACATGGTCCCCAGGCTGGCCTGAGGATGT 
CTGTGGTGTGAAAGGTGGCCAGCCTGGTTCTCTTCCCTGCTCAGG^ 
AATGGCAGAAAGGACATTCCCCCrCCCCTCCCCAGGTGACCTGCT 
CCCCTCTCCCCACATGTATCCCTCGGTC^ 

GCATTGCTCAGAGTCCCAGGTCCTGGCCIX3ACCCTCAGGCCCOT 

ACCAATTTGTGGGTAGTTCATCTTCCCTCGATTGGTTAACTCCTTAGTTTCAGACCACAGA 

TCAAGATTGGCTCTTCCCAGAGGGCAGCAGACAGTC^ 

GGGAGGCCAATCAGCCCCCTGAAGACTCTGGTCCCAGTCAGCC^ 

CCTGTGACCTTCTGCCAGAATTGTCATGCCTCTC^ 

TAACC^CTGAAGCCCCCAATTCCGACAGCT 

TCTAATCTGATATTGACATATTAGAAGGCAATTAGGGTGTTTCCTT^ 
AGGATCAGCCCTGAGAGCAGGTTGGTGACTTTGAGGAGGGCAGTCCTCTG 
TGGGAGCAAGGGACAGGGAGCAGGGCAGGGGCTGAAAGGGGC^ 
CAACTACACACCAACATGCTGGCTTTA 
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FTOTJRE 266 

MRGATRVS IMLLLVTVSDCAVITGACERDVQCGAGTCCAI SLWLRGLRMCTPLGREGEECHP 
GSHKVPFFRKRKHHTCPCLPNLLCSRFPDGRYRCSMDLKNINF 
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FIGURE 267 

AGCGCCCGGGCGTCGGGGCGGTAAAAGGCCGGCAGAAG<^ 

CTCC^GGACCCAAGTTTCTTCACCATGGGGATGTGGTCC^^ 

TCCTGCCTTGGCATTGCTGCTTGCCAAC^ 

CCCTGGAGTACCTGGAGGATATAGACCTGAAAACACTGGAGAAGGAACCAAGGACTTTCAAA 
GCAAAGGAGCTATGGGAAAAAAATGGAGCTGTGATTATGGCCGT^ 

CCTCTGTCGAGAGGAAGCTGC^ATCTGTCCTCCCTGAAAAGCATGTTGGACCAGCTGGGCG 

TCCCCCTCTATGCAGTGGTAAAGGAGCACATCAGGACTGAAGTGAAGGATTTCCAGCCTTAT 

TTCAAAGGAGAAAT CrTCCTGGATGAAAAGAAAAAGTTCTATGGTCCACAAAGGCGGAAGAT 

GATGTTTATGGGATTTATCCGTCTGGGAGTGTGGTACAACTTCITCCGACCCTGGAACGGAG 

GCTTCTCTGGAAACCTGGAAGGAGAAGGCTTCATCCTTGGGGGAGTTTTCGTGGTGGGATCA 

GGAAAGCAGGGCATTCTTCTTGAGCACCGAGAAAAAGAATTTGGAGACAAAGTAAACCT ACT 

TTCTGTTCTGGAAGCTGCTAAGATGATCAAACCACAGACTTTGGCCTCAGAGAAAAA^ 

TGTGTGAAACTGCCCAGCT(^GGGATAACC^GGGACATTCACCT<3TGTTCATGGGATGTATT 

GTTTCCACTCGTGTCCCTAAGGAGTGAGAAACCCATTTATACTCTACTCTCAGTATGGATTA 

TTAATGTATTTTAATATTCTGTTTAGGCCC^CTAAGGCAAAATAGCCCCAAAACAAGACTGA 

CAAAAATCTGAAAAACTAATGAGGATTATTAAGCTAAAACCTGGGAAATAGGAGGCTTAAAA 

TTGACTGCCAGGCTGGGTGCAGTGGCTCACACCTGTAATC^ 

TGAGCAAGTCACTTGAGGTCGGGAGTTCGAGACC^^ 

TCTACTAAAAATACAAAAATCAC02GGGTGTGGTGG<^GGCAGCTGTAGTCCCAGCTACCCG 
GGAGGCTGAGGCAGGAGAATCACTTGAACCTGGGAGGTGGAGGTTGCGGTGAGCTGAGATCA 
CACCACTGTATTCCAGCCIXSGGTGACTGAGACTCTAACTAA 
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FtCTTRF. 268 

MSFLQDPSFFTMCTWSIGAGALGAAALAlilA 

TFKAKELWEKNGAVIMAVRRPGCFLCREEAADLSSLKSMLDQLGVPLYAVVKEHIRTEVKDF 
QPYFKGEIFLDEKKKFYGPQRRKMMFMGFIRLGVVfnJFFRAV^GGFSGNLEGEGFILGGVFV 

VGSGKQGILLEHREKEFGDKVNLLSVLEAAKMIKPQTLASEKK 



WO 99/63088 -C^fS^MO PCT/US99/12Z52 

FIOIJKE 269 

ACGGACCGAGGGTTCGAGGGAGGGACACGGA^ 

GGCCAGGTGCCCCGTCGCAGGTGCCCCT^ 

AGCCCCTTCCTCGGCTCCTGCC^C^ 

CTTCTGGCGCTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGA^ 

GACCACTTCTGCAAATGAGAATAGCACTGT 

GCAACCTGCGTCCGGAAGCCATCACTGCTAT 

CTCCTGGCTGTGGGGCTGGCACTGTTGGTGCGGAAGCITCGGG^ 

CACCTACCGGCCCAGTAGCGAGGAGCAGTTCTCCC^ 

ACTCCAAGGAGACGGTGCAGGGCTGCCTGCCC^^ 

CTTCATTGCTGTGTGACCTTGGGGAAAGGCAGTGCC 

TGCTTAATAGCAGGGAAGAAGGTACTTCAAAGACT^ 

GCTATTCACTTrTATATATTTATATAAAATTAGTAGTGAGATGTAAAAAAAAAA 
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FTOTTRE 270 

^^PGI^LIAiAIX3LPFLlJ^WGRAWGQIQTTSANENSTVLPSSTSSSSDGNLRPEAITAIIV 
VFSLIAALIiLAVGIjALLVRKLREKRQTEGTYRPSSEEQFSHAAEARAPQDSKETV 
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FIGURE 271 

AATATATCATCTATTTATCATTAAT 

TTGGGATTTTAATTTTCAAACACAGCAGAM 

GTATGTGAAGCTATTTGGAGATCCAATTCA^ 

AT CAAG AAATAAAGAGAACCACAGTCAACCGACAC^ 

CTACCAAAGCTGTCAAAACCACAGGCAAGGGC&^ 

GGGTTAATTCTTGGTGCTGAAGCCTGGGGCAGGGGTGTAAAGAAAAACACT 
ATTGTAAATTTAAGGCAAATACACATATTAGTA 

TATACAATAAGGTGAAATTATAAGTACCCTATGCAGTTGGCTGGACAGTTCTA^ 

TTATTAATTTTTAAAATCAGTAACTG^ 

TCATATAATTTGATACAAATAAAAGAAAAGTGTTCTCTCC^ 

ATGCGATACAGTTAGAATAGGAAATATGACATTA^^ 

AGAAGGGAAAATGTTGCCAAGGAAAAAAAAA 



wow/bom -eSfff Al o pcrraswinsj 

FTCTTRF, 272 

MTFFLSIiLI^VCEAIWRSNSGSNTLENGYFLSRNKENHSQPTQSSLEDSVTPTKAVKTTGK 
GIVKGRNLDSRGLILGAEAWGRGVKKNT 
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FIGURE 273 

nm^mcx a at A ac^ AGAGAGGAAC AATGG GGTTATTCAGAGGTTTTGTTTTCCTCTTAGTTCT 
GTGCCTGCTGCACCAGTCAAATACTTCCTTCACT^ 

TTGTCATTGTTATAGATCCTAGTGTGCC^GAAGATGAAAAAATAATTGAACAAATAGAGGAT 

ATGGTGACTACAGCTTCTACGTACCTGTTTGAAGCCACAGA 

TGTATCTATATTAATTCCTGAGAATTGGAAGGAAAATCCT 

AAAACCATAAACATGCTGATGTTATAGTTGCACCACCTACACTCCCAGGTAGAGATGAACCA 

TACACCAAGCAGTTCACAGAATGTGGAGAGAAAGGCGAATACATTCACTTCACCCCTGACCT 

TCTACTTGGAAAAAAACAAAATGAATATGGACCACGAGGCAAACT 

CTCACCTCCGGTGGGGAGTGTTTGATGAGTACAATGAAGATCAGCCTTTCTACCGTG 

TCAAAAAAAATCGAAGCAAC^^GGTGTTCCGCAGGTATCT 

GTGTCAAGGAGGCAGCTGTCTTAGTAGAGCATGCAGAATTC^ 

GAAAAGATTGTCAATTCnTTCCTGATAAAGTACAAACAGAAAAAGCATCCATAATGTTTATG 

CAAAGTATTGATTCTGTTGTTGAATTTTGTAACGAAAAAACCCATAATCAAGAAGCTCCAAG 

CCTACAAAACATAAAGTGCAATTTTAGAAGTACATGGGAGGTGATTAGCAATTCT 

TTAAAAACACCATACCCATGGTGACACCACCTCCTCC^CCTGTCTTCTC^ 

AGTCAAAGAATTGTGTGCTTAGTTCITGATAAGTCT 

AAATCGAATGAATCAAGCAGCAAAACATTTCCTGCTC 

TGGGGATGGTTC^CTTTGATAGTACTGC(^CTATTGTAAATAAGCTAATCCAAATAAAAAGC 

AGTGATGAAAGAAACACACTCATGGCAGGATTACCTACATA 

CTGCTCTGGAATTAAATATGCATTTCAGGTGATTGGAGAGCTACATTCCCAACT 

CCGAAGTACTGCTGCTGACTGATGGGGAGGATAACACTGCAAGTTCTTGTAT^ 

AAACAAAGTGGGGCCATTGTTCATTTTATTGCTTTGGGAAGAGCTGCTGATGAAGCAG 

AGAGATGAGCAAGATAACAGGAGGAAGTCATTTTTATGTTTCAGATGAAGCTCAGAACAATG 

GCCTCATTGATGCTTTTGGGGCTCTTACATC^GGAAATACTGATCTCT 

CAGCTCGAAAGTAAGGGATTAACACTGAATAGTAATGCCTGGATGAACGACACT 

TGATAGTACAGTGGGAAAGGACACGTTCTTTCTCATCACATGGAAC^GTCTGCCTCCCAGTA 

TTTCTCTCTGGGATCCCAGTGGAACAATAATGGAAAATTTCACAGTGGATGCAACIT 

ATGGCCTATCTCAGTATTCCAGGAACTGCAAAGGTGGGCACTTGGGCATA 

C^AAGCGAACCCAGAAACATTAACTATTACAGTAACT 

CTCCAATCACAGTGAATGCTAAAATGAATAAGGACGTAAA^^ 

GTTTACGCAGAAATTCTACAAGGATATGTACCTGTTCTTGGAGCCAATGTGACTGOT 

-TGAATGAGAGAATGGAGAT-AGAGAAGT-T-T-TC^ 
CTTTCAAGAATGATGGAGTCTACTCCAGGTATTTTACAGCATATACAGAAAATGGK^ 
AGCTTAAAAGTTCGGGCTCATGGAGGAGCAAACACTGCCAGGCTAAAATTACGGCCTCCACT 
GAATAGAGCCGCGTA<^TACCAGGCTGGGTAGTGAACGGGGAAATTGAAGCAAACCCGCC^A 
GACCTGAAATTGATGAGGATACTCAGACCACCTTGGAGGATTTCAGCCGAACAGCATCCGGA 
GGTGCATTTGTGGTATCACAAGTCCCAAGCCTTC 
AATCACAGACCTTGATGCCACAGTTCATGAGG 

GAGATAATTTTGATGTTGGAAAAGTTCAACGTTATATCATAAGAATAAGTGCAAGTATTCTT 
GATCTAAGAGACAGTTTTGATGATGCTCTTCAAGTAAATACTACTGATCTGTCACCAAAGGA 
GGCCAACTCCAAGGAAAGCTTTGCATTTAAACCAGAAAATATCT 

AGATATTTATTGCCATTAAAAGTATAGATAAAAGCAATTTGACATCAAAAGTATCCAACATT 
GCACAAGTAACTTTGTTTATCCCTCAAGCAAATCCTGATGACATTGATCCTACACCT 
TACTCCTACTCCTACTCCTGATAAAAGTCA^ 
TGTCTGTGATTGGGTCTGTTGTAATTGTTAACTT^ 

ACGAAGAAAAAAATCTTCAAGTAGACCTAGAAGAGAGTTTTAAAAAACAAAACAATGTAAGT 

AAAGGATATTTCTGAATCTTAAAATTCATCCCATGTGTGATCATAAACTCATAAAAATAATT 

TTAAGATGTCGGAAAAGGATACTTTGATTAAATAAAAACACTCATGGATATGTAAAAAC^ 

CAAGATTAAAATTTAATAGTTTCATTTATTTGTTATTTTATTTGTAAGAAATAGTGATGAAC 

AAAGATCCTTTTTCATACTGATACCTGGTTGTATATTATTTC 

GATATTTCAAATTGCATCAAGAAATTAAAATCATCT^ 

GGAGAGCAAATAAACAACATTTGGAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 274 

MGU'RGFWLLVLCLIiJQSNTSFIKlJ^GFEDIVIVIDPSVPEDEKIIEQIBDMVTTMTY 
LFEATEKRFFFKNVSILIPENWKENPQYKRPKHENHKHADVIVAPPTLPGRDEPYTKQFTEC 
GEKGEYIHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATR 

CSAGISGRNRVY KCQGGS CLSRACRIDSTTKLYGKDCQFFPDKVQTEKAS IMPMQS IDSWE 
FOJEKTHNQEAPSLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLV 
LDKSGSMGGKDRLNRMNQAAKHFLLQTVENGSI^GMVHFDSTATIVNKLIQIKSSDERNTLM 
AGLPTYPLGGTSICSGIKYAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIVH 
FIALGRAADEAVIEMSKITGGSHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKGLT 
LNSNAWMNDTVI IDSTVGKDTFFLITWNSLPPS I SLWDPSGT IMENFTVDATSKMAYLS I PG 
TAKVGTWAYNLQAKANPETLTITVTSRAANSSVPPITVNAKMNKDWSFPSPMIVYAEIL^ 
' YVPVLGANVTAFIESQNGHTEVLELLDNGAGADSFKNDGVYSRYFTAYTENGRYSLKVRAHG 
GANTARLKLRPPLNRAAYI PGWWNGEI EANPPRPE IDEDTQTTLEDFSRTASGGAFWSQV 
PSLPLPDQYPPSQITDLDATVHEDKI ILTWTAPGDNFDVGKVQRYI IRISASILDLRDSFDD 
ALQVNTTDLSPKEANSKESFAFKPENISEENATHIF IAIKSIDKSNLTSKVSNIAQVTLFIP 
QANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWIVNFILSTTI 
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^ 5 fao 

FTGIJRE 275 



CTCCTTAGGTGGAAACCCTGGGAGTAGAGTACT 

GCAGGGGTGACAACAGGTGTCATCTTTTTGATCTCGTGTGTGGCT 

GCCAAGGTAATTTTGACCCAGAGGAGOATGATGTAGCCACCTCCT 

AGTTATGCCAGGATTTACTAGAGAGTGTCAACTCAACCAGCAAGCGGCTCCTO 

TTGGAGGAGAGAACCTTTGTGGGGCTGCGTTCTCTTAGCAGTC 

GACCAGAAGAAAGGAAAGGTCCCCTCTTGCTGTTGGCTGCACATCAGG^ 

GGTGAAAACTTGGAGATTTCACXTCAGTCATTGCTTCT 

AGCTGCTCTGTGTGGTGGTTAACTCCAAGAGGCAGAACTCGTTCTAGAAGGAAATGGATGC^ 
TCCGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGG 

TTGAGGGATGCCACCGGTTCTGGACGCATGGCTGATTC 

TGCGTGGATTTCCCGGGTGGTGGTTTTGCTGGTGCTCCTCTO 

TGGCCTGCACCCCAAAAGteTGACGAGGAGCAGCTGGGACT 

GAGGGGTACCAGGCCGTCCTTCAGGAGTGGGAGGAGCAGCACCGCAACTACGTGAG 

GCAGATCGCACAGCTCAAGGAGGAGCTGCAGGAGAGGAGTC 

CCAGCGATGCTGCTGGCCTGGGTCTGGACAG^ 

TTCCTGCACTCGCAGGTGGACAAGGCAGAGGTGAATGCTG 

AGTGCCTTTCGATAGCTTTACTCTACAGAAGGTGTACCAGCTGGAGACTGGCCT 

AGGAGAAGCCTGTGAGGAAGGAC^GCGGGATGAGTTGGTGGAA^ 

CTGAACAATCCTGCAGAGAACAGCCCCAATCACCGT C CTTACACGGCCTCTGATTTCATAGAAGGGAT 

CTACCGAACAGAAAGGGACAAAGGGACATTGTATGAGCTCACCTTCAAAG 

TCAAACGGCTCATCTTATTTCGACCATTCAGCCCCATCATGAAAGTGAAAAATGAAAAGCT 

GCCAACACGCTTATCAATGTTATCGTGCCTCTAGO^AAAGGGTGGACAA 

GAATTTCAGGGAGATGTGCATTGAGCAGGATGGGAGAGTCCATCTCACTC 

AAGAAATAAATGAAGTCAAAGGAATACTTGAAAACACTTCCAAAGCTGCCAACTT 

TTCATCCAGCTGAATGGAGAATTTTCTCGGGGAAAGGGACT^ 

AAGCAACGTCCTTCTCTTTTTCTGTGATGT^ 

GGCTGAATACACAGCCAGGGAAGAAGGTATTTTA^ 

ATATACGGCCACCATGATGCAGTCCCTCCCTTG^ 

TTGGAGAGACTTTGGATTTGGGATGACGTGTCAGTATCGGTCAGACTTCATCAATA 

ATCTGGACATCAAAGGCTGGGGCGGAGAGGATGTGCACCTTTATCGCAAGTATCT 

ATAGTGGTACGGACGCCTGTGCGAGGACTCraCCACCTCTGGCATGAGAAGCG 

GACCCC CG AG CAGTACAAGATGTG CATGCAGTCCAAGGCCATGAACGAGGCATCCCACGGCCAGCTGG 
GCATGCTGGTGTTCAGGCACGAGATAGAGGCTCACCTT CG CAAACAGAAACAGAAGACAAGTAGCAAA 
AAAACATGAACTCCCAGAGAAGGATTGTGGGAGACACTTTT 



AGCCTCCGATTTCTCTCTGTTGGGCTTTTTACAACAGAAATCAAAA 

AACCC^GTTGCACCCTGTGAAGTGTCTXSA^ 

TGGAGGTTTTGATGGTGTTTAQ^ATACACTGAGACXrrc 

ATTTAAGAGCAGTTTTGTAAAAAATTCATTAGCATGAAAGGCAAGCATATTTCTCCT 
GCCTATCAGCAGGGCTCTAGTTTCTAGGAA^ 

TTATGATACTAGTGAGTACATTAAGTAAAATAAAATGGACCAGAAAAGAA^ 
TGTCATATTTTCCCCAAGATTAACCAAAAATAATCTC 

nrcv w r I " l " r i 'tr i ' l ^TATTTAAAAATGCACTTTTTTTCCCTTGTGAGTTATAGTCTGCTTATTTA ATTA 

CCACTTTGCAAGCCTTACAAGAGAGCACAAGTTGGC 

GAGATGCATTATGAGAACTTTCAGTTCAAAGCA^ 

CTGATTCTGTCAGGCACTGAATGTCAGGCATTGAGACATAG^ 

CGTACAGATACTTTCTCTGAAGAGTATTTTCGAAGAG^^ 

ACACTTTCTGCTTTACAGAAAAGGAAACTCATTGAGACTG^ 

AAACCACATTTTCTCCTCAGAAGTAGGGACCGCTTTCTTAC 

GTGAACCAAACAATCTCTTTTCAAAACAGGGTGCTCC CTGGCTTCTGGCTTCCATAAGAAGAAATG 

GAGAAAAATATATATATATATATATATATTGTGAAAGATCAAT CCATCTGC CAGAATCTAGTGGGATG 

GAAGTTTTTGCTACATGTTATCCACCCCAGGCCAGGT 

AGTTCTACTCAATCACCAAGATGCTTCTGAAAATTGCATTTTA 

AATAAATACAGTTAACATAGAGTGGTTTCITCATTCATGTGAA 

GAGCTAATTATCTCTTTGAGTCCTTGCTTCTGTTTC 

GAAGATTCAAGCTGTTGGTGTGTTAAAAAATGCATTGTATTGATT^ 

AATTAAAACACAGGCXATGAATGGAAGGTGGTATTC 



WO 99/63088 -*%C>fe, XO PCT/US99/12252 

FTOTTRE 276 

MMMVRRGLLAWISRVVVLLVLLCCAISVLYM^ 

EWEEQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDi^GLGIJJRSPPEKTQADIilAFL 
HSQVDKAEVNAGVKIATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIES 
ALETLNNPAENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLILFRPFSPI 
MKVKNEKLNMANTLINVIVPIAKRVDKFRQFMQNF^ 

GILENTSKAANFRNPTPIQLNGEFSRGKGLDVGARF^GSNVLLFFCTVDIYF^ 
LNTQPGKKVFYPVLFSQYNPGI I YGHHDAVP PLEQQLVI KKETGFWRDFGFGMTCQYRSDFI 
NIGGFDLDIKGWGGEDVHLYRKYLHSNLIVVRTPVRGLFHLWHEKRCMDELTPEQYKMCMQS 

KAMNEASHGQIX5MLVFRHEIEAHLRKQKQKTSSKKT 



WO99/630M -&&l/l\0 PCT/US99/12252 

FIGURE 277 

GAAAGAATQTTGTGGCTGCTCTTTTTTCTGGTGACTGCCATTCATGCT6AACTCT 
AGGTGCAGAAAATGCTTTTAAAGTGAGACTTAGTATCAGAACAG 

atgcctgggataccaatgaagaatacctctto^gcgatggtagctttctcc^tgagaaaa 

gttcccaacagagaagcaacagaaatttcc<^tgtcotactttgcaatgtaacccagagggt 

atcattctggtttgtggttacagacccttcaaaaaatcacac<x:ttcct 

aatcagccataagaatgaacaagaaccxsgatcaa 

ctggaatttttaaaaatcccttcca<^cttg 

gattattatatttggtgtgatattttgcatcatcatagttgc^ 

cagggatctggcaacgtagaagaaagaaow^gaaccatctgaagtggatgacgck3aagat 

aagtgtgaaaacatgatcacaattgaaaatggcatcccctc^^ 

gggcatattaatgatgccttcasgacagaggatgagaggctcacccctctcrgaagggctgt 

tgttctgcttcctcaagaaattaaacatttctttctgtgtgactgcrga 

ccaagagcagatcatatattttgtttcaccattcitcttttgtaat 

tgaaagtgaaaagcaatcaattatacccaccaacaccactgaaatcataagctattcacgac 

tcaaaatattctaaaatatttttctgacagtatagtgtataaatgtggtcatgtggtatttg 

tagttattgatttaagcatttttagaaataagatcaggcatatgtatatattttca<^cttc 

aaagacctaaggaa aaataa a ttttccagtggagaatacatataatatggtgtagaaatcat 

tgaaaatggatcctttttgacgatcacttatatcactctgtatatgactaagtaaacaaaag 

tgagaagtaattattgtaaatggatggataaaaatggaattacrcatatacagggtggaatt 

ttatcctgttatcacaccaacagttgattatatattttctgaatatcagcccctaataggac 

aattctatttgttgaccatttctacaatttgtaaaagtccaatctgtgcraacttaataaag 

TAATAATCATCTCTTTTTAAAAAAAAAAAAAAAAAAAAAAAAAA 
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PCT/US99/12252 



FTCITRE 278 

MLWLLFFLVTAIHAELCQPGAENAFKVRLSIRTAI^DKAYAWDTNEEYLFKAMVAFSMRICVP 
NREATE I SHVLLCNVTQRVS FWFWTD P SKNHTLP AVEVQS A I RMNKNRI NNAF FLNDQTLE 
FLKIPSTIAPPMDPSVPIWII IFGVIFCI I IVAIALLILSGI WQRRRKNKEPSEVDDAEDKC 
ENMITIENGIPSDPLDMKGGILMMPS 



WO 99/63088 -^<S§ fi\ O PCT/OS99AJ2S2 

FIGURE 279 

AACTCAAACTCCTCTCTCTGGGAAAACGCGGTGCTTGCTCCT 

GTGTTGGAGCCCTCGGTCTGCCCCGT^^ 

ATGGCAAGAGCTCTACTCGTGCGGTGCTTCTTCTCCT^ 

CCTATAGCAGCTGTGGAAATTTATACCT 

TCGGTTAAAATGCACTTTCTCCAGCTTTGCCCCTGTGGGTGA 

ATTTTTOTCCTCTAGACGGGGGACCTGAGCAGTTTGTATTCTACT 

CAACCCATGAGTGGGCGGTTTAAGGACCGGGTGTCTTGGG 

TGCCTCCATCCTTCTCrGGAAACTGCAGTTCGACGACAATGGGA 

AGAACCCACCTGATGTTGATGGGGTCATAGG 

CGCTTCTCTGAGATCCACTTCCTGGCTCTGGCCATTGGCTCTGCCT 

AATAGTAATTGTAGTGGTCCTCTTCCAGCAT^^ 

ATAAAGTGGTGGAGATAAAATCAAAAGAAGAGGAAAGGCTCAACCA^ 

GTTTATTTAGAAGACACAGAOT^CAATTTTAGATGGAAGCTGAGATGAT 

GAACCCTAGTATTTCTTGAAGTTAATGGAAACTTTTCTTT^ 

TTTCCAACCAGTTCTGCAGCATATTAGATTCTAGACAAGCAACA 

AGTGCTCCTCCATATCACCAGTCATAGACAGCCT 

GTGTAAATTTTTTCAAGTGCTCATTAGGTTTTATAAACAAGAAG^ 

GACACTACTTACAGTGTTATGACTTGTATACACATATATTGGTATCAA^ 

AATTTGTCTGTTACATTTCCTTTCA^ 

ATGTGTTTACTCTCTTTCCTTCCCACATTCTC^ 

TTTCTGATTAACAGTAAATCCTAAATTCAAA 

TCCTTAACTATGAGACACATCTTGTTTTACT 

TTTGTCG 



WO 99/63088 /*\<> PCT/US99/12252 

280 

MYGKS STRAVLLLLG I QLTALWP I AAVE I YTSRVLEAVNGTDARLKCT FS S FAPVGDALTVT 
V^RPIJXX3PEQFVFYYHIDPFQPMSGRFKDRVSWDC^PERYDASIIiWKI^FDDNGTYTCQ 

VKNPPDVDGVIGEIRLSVVHTVRFSEIHFLALAIGSACALMI 1 1 VI VWLFQHYRKKRWAER 

AHKWE I KSKEEERLNQEKKVSVYLEDTD 



WO 99/63088 ^ Q PCT/US99/12252 

FIGURE 281 

GGATTTTTGTCTGTGCTCCCTGATCT^ 
ACTCTITGGGAGTTTCCATCTTT 

ACACGTATCCAGCTACTGGTCCTGCTGATGATGAAGCCCCTGATC 

GCAACCACTGCGACCACTGCTGCtCCTACCACTC 

TAAAGACATTCCAGTTTTACCCAAATGGGTT 

GAGATGGAATCAGCTTGAGTCTTCTGCAATTGGTCACAACT 

ATCCAACTACTTACCTTGCCTACXjATATCCCCTTTATCT 

ATAAAAAATAACTATGAGCAACATAAAAAAAAAAAAA 



WO 99/63088 «£8S/4aO 

FTttlJRE 282 



PCTAJS99/12252 



MKFLAVLVLLGVS I FLVS AQNPTTAAPADTYPATGP ADDE1APDAETTAAATTATTAAPTTAT 
TAASTTARKD I PVLPKWVGDLPNGRVCP 



WO 99/63088 -^8^"/^ VO PCT/US99/1MS2 

FTOTJRE 283 

GGACTCTGAAGGTCCCAAGCAGCTGCTC 

CTAGGGGTCTGGATTTGCTGGTTAACAAGA 

TACCTCCTGCCCTTCCACCTGCCCTGGTGTTC^ 

GTGTCCTGGGTCAGGGACGOfcGAGGATC 

ACTTGGCAAGGTCCAGCGATGGTCCGGAGTC 

GACA6TTCTGTTGTGCTTTCTTGGACAG 

GCAAACTCCATAAAGAACCAGAGGGTCTGGGCCCCX3GCCA 

CTGCTGCTGGCCAGTGGGAGTGGCACGAGGTGGGGCTTTGTG^ 

ATTTGCCTGCGGGCCATGGTCCCTGTCTAGOT 

CCCCAAAGAGCTTTCATTGTATCTATTGATT^ 

GGGCCGGGCACGGTGGCTCACGCCTGTAATCCC^ 

CACCTGAGATCAGGAGTTCAAGACCAGCCrGGCCAACATGGTGA 

TACAAAAAATTA<^CAGGCACAGTGGTGTGCACTGGTAGT€CCA<3T^ 

GCAGGAAAAT03CTTGAACCCAGGAGGCGGACGTTGCGGTGAGCOT 

TCCAGCCTGGGCGACAAGAGTGAGACTCCATCTCACACA 



WO »/»<«< -e8$/4vO PCT/USW132B 

FIGURE 284 

MLPPALPPALVFTVAWSLLAERVSWVRDAEDA^ 

GGQFCCAWLDSKRVLAS PGWGAANS I KNQRVWAPATESS AQLLCCWPVGVARGGALCQ 



WO 99/63088 




PCT/US99/122S2 



FTfiTIRF 285 

GTCATC^CAGTGCCTGCTCTGTGCCTGCTCTGGGCCC 

GCGGCCCCCA.TGGGCGGCCCAGAACTGGCACAGCATGAGGAGCTGACCCTGCTCTTCCATGG 

GACCCTGCAGCTGGGCCAGGCCCTCAACX^TGTGTACAGGACCACGGAGG 
AGGCCAGGAACAGCCTGGGTCTCrATGGCCGCACAATAGAACTCCTGGGGCAGGAGGTCAGC 

CGGGGCCGGGATGCAGCCCAGGAACTTCGGGCAAGC<^ 

TATTCTGCAGCTGCAGGCAGAGGCCACAGCTGAGGTGCTGGGGGAGGTGGCCCAGGCACAGA 
AGGTGCTACGGGACAGCGTGC^GCGGCTAGAAGTa^^ 

GCCTACCGAGAATTTGAGGTCTTAAAGGCTCACGCTGACAAGCAGAGCCACATCCTATGGGC 

CCTCACAGGCCACGTGCAGCGGCAGAGGCGGGAGATGGTGGCACAGCAGCATCGGCTGC 

AGATCC^GGAGAGACTCC^CA.C^GCGGCGCTCCCAGCCTGAATCTGCCT 

GACCAATCATGCTGCAAGGAACACTTCCACGCC^CSTGAGGCCC<TGTGCAGGGAGGAGCTG 

CCTGTTCACTGGGATCAGCCAGGGGGCCGGGGCCCACTTCTG^ 

GCAGGCGGGGACAAAGGCAGAGGATGTAGCCCCATTGGGGAGGGGTGGAGGAAGGACATGTA 
CCCTTTCATGCCTACACACCCCTCATTAAAGCAGAGTCX3TGGCATTTCAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAA 



WO 99/630*8 ' PCT/US99A2252 



«3 <9/Sv© 
FTOTJRE 286 



MPVPALCLLWALAMVTRPASAAPMGGPELAQHEELTLLFHGTLQLGQALNGVYRTTEGRLTK 
AIWSLGLYGRTIELIX3QEVSRGRDAAQELRASLLETQMEEDILQLQAEATAEVLGEVAQAQK 
VLJIDSVQRLEVQLRSAWLGPAYREFEVLKAHADKQSHILWALTGHVQRQRREMVAQQHRLRQ 

IQERLHTAALPA 



WO 99/63088 O PCT/US99/122S2 

FIGURE 287 

GGCAACATGGCTCAGCAGGCITGCCCCA 

CCTGGTGATCACCTTACTCCTGGACCAGAC 

AGCACAGCAAACGTCGAGTGAGAGACAAGGATGGAGA 

TGGACAGAAGTCAATGCCTTGAAGGAAATTCAAGCCCTGCAGACAGT^ 

TAAAGTTCACAAGAAATGCTACCTTGCTTCAGAAGGTTTGAAGCA 

AAGACTGCATTTCCAAAGGAGGAATCCTGGTTATC 

CTCCAAGACTATGGTAAAAGGAGCCTGC^ 

CATGGTCACGGAAGGCAAGTTTGTTGACGTCAACGGAATCGCTATCTC 

ACCGTGCACAGCCTAACGGTGGCAAGCGAGAAAACTGTGTCCTGTTCTCCCAA 

GGCAAGTGGAGTGATGAGGCCTCTCGCAGCAGCAAGAGATACATATC 

TAA ATAGG TCTTTCTCCAATGTGTCCrCCAAGCAAGATTCATCATAACTTA 

TCTCTAAGATCAAGTAAAAATCATAATTTTTACTTATTAAAAAATT^ 

GTCCATAGCAATATGATAGCATCAGCCAATTTTGCTAAC^ 

TCCTGGGGTATAGGGGATCAGAAATATTGATC 

AAACAGACTAAAATCETTCTCT^ 

AAATCACAGTAGCAATGCAACTCATCACTCTAGA 

TTCCCTTGGAAGTTTAGCGTATGTTTGACTAACAAAAATTCCCTACATCA<3AGACTCTAGGT 



GCTATATAATCC^AAAACTTTTCAGCCTOTTGCTCATTCTC 

TTGTCAGCCCATTACCCTTATTTTGAATTGCT 

GCCATATCAGAACACAAACCCCTGAAGAGGTTCTGATC 

TACCCTTTTTTTGGAAGTTTCCAGCCGCAATTTGAAATGAAATGACAAGGTGTATATTTGAT 

CAATTTTCATTCCCACCATTGCATTACAACCTCTAACTTAAATG^ 

CAAAGAAGCAGATTGCATGATAAACXMAAATAGAAAAAAAGAACCTA 

AGCATCCTTACTCTCACCTTTTATGAGATTGAGAGTGGACT 

TCGTATATTTATTTTTTT^ 

TGGAAGCTGAAAACTGAATTTAAAGAATGCTATCTTGGAAAATTGCATACGTOT 

TTTTATTCTGCCTAGTGCTATTCTGCITGTTTAACTAGATTGTACA 

TAATATCAAATTACAAAGTTTAGACTTGGAGGGAAATGGGCTTTT^ 

AAATATATTTTGTTCTTCAAATAAATAGTGTTTAAAC^ 

CCCACTTT<XIAAACTTTAACTACACATGCT^ 

ATAATAAAGCCTGAATTCTC^TCAATAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



WO 99/63088 



PCT/US99A2252 



MAQQACPRAMAKNGLVICILVITLLLDQTTSHTSRLKARKHSKRRVRDKDGDLKTO 
EWALKEIQALQTVCLRGTKVHKKCYLASEGLKHFHEANEDCISKGGILVIPRNSDEINALQ 

DYGKRSLPGVITOFWjGINDMVTEGKFVDWGIAISFLNWDRAQPNGGKRENCV^ 
WSDEACRSSKRYICEFTIPK 



WO 99/63088 / PCT/US99/12252 

FIGURE 2S9 

GCGAGGACCGGGTATAAGAAGCCTCGTGGCCTT^ 

CCCGAGCCCCCGCGCCATOAAGCTCGCCGCCCTCCTGGGGCTCT^ 

GCTCOSCTGCTGCrTTCTTAG^^ 

GAGTCGGCGGCGGAGGCCGGGGCCGGGACCCTGGCCAACCCCCTCGGCACCOT 

GAAGCTCCTGCTGAGCAGCCTGGGCATCCCCGTGAACCACCT 

GTGTGGCTGAGCTGGGTCCCCAGGCCG^ 

GCCCTGACAGTGTTTGGCT^GCCGAGACTGGAGCATCTACACCTGAGGACA^ 
CACCCGCGAGGGCTGAAAACCCCGCCGCGGGGAGGACCGT 
OTCAATAAACGTGGTTAAGAGCAAAAAAAAAAA7VAAAAAAAAAAAAAAAA 
AAAAAAAAAAAA 



WO 99/63088 PCT/OS99A2252 

FTCTJRE 290 

MKLAALLGLCVALSCSSAAAFLVGSAKPVA^ 
SLGIPVNHLIEGSQKCVAELGPQAVGAVKALKALLGALTVFG 



WO 99/63088 -^bSj^O PCTAJS99/122S2 

FICTJRE 291 

TGAAGGACTTTTCCAGGACCCAAGGCCACACA^ 

CCTTGGCCTCCGCAGCCGATCACMSAAGGTGGTGCCAAGTCTCCTGCTCTCCX3TCCTCCTG 

GGACAGGTGTGGCTGGTAOICGGCTTGG CCCCCAGT CCTCAGTCGCCAGAGACCCCAGCC CC 
TCAGAACCAGACCAGOVGGGTAGTGCAGGCTCCCAGGGAGGAAGAGGAAGATGAGCAGGAGG 
CCAGCGAGGAGAAGGCCGGTGAGGAAGAGAAAGCCTGGCTGATGGCCAGC^GGCAGCAGCTT 

GCCAAGGAGACTTCAAACTTOGGRTTCAGCCTGCTGOGAAAGATCT 

CAACATGGTCTTCTCTCCATTTGGCATGTCCTTGGCCATGACAGGCTTGA 

C^GGGCCGACTGAAACCC^GATCAAGAGAGGGCrCCACTTGCAGGCCCTGAAGC^ 

CCCGGGCTCCTGCCTTCCCTCTTTAAGGGACTCAGAGAGACCCTCTCCCXK!AACCT GGAAC T 

GGGCCTCTCACAGGGGAGTTTTGCCTTCATCCACAAGGATTTTGATGTCAAAGAG^ 

TCAATTTATCCAAGAGGTATTTTGATACAGAGTGCGTGCCTATGAATTTTOGCAATGCCTCA 

CAGGCCAAAAGGCTCATGAATCATTACATTAAC^AAGAGACT 

GTTTGATGAGATTAATCCTGAAACCAAATTAATTCTTC 

AATGGTTGACCCCATTTGACCCTGTCTTCACCGAAGTGGACACTTTCCACCTGGACyAGTAC 

AAGACCATTAAGGTGCCCATGATGTACGGTGCAGGCAAGTTTGCCTCCACCT 

TTTTCGTTGTCATGTCCTCAAACTGCCCTACCAAGGAAATGCCACCATGCTGGTGGTCCTCA 

TGGAGAAAATGGGTGACC^CCTCGCCCTTGAAGACTACCTGACC^CAGACTTGGTGGAGACA 

TGGCTCAGAAACATGAAAACCAGAAACATGGAAGTTTTCTTTCCGAAGTTCAAGCT 

GAAGTATGAGATGC^TGAGCTGCTTAGGC^GATGGGAATC^GAAGAATCTTCTCACCCTTTG 

CTGACCTTAGTGAACTCTCAGCTACTGGAAGAAATCTCCAAGTATCCAGGGTTTTACGAAGA 

ACAGTG ATTGAAGTTGATGAAAGGGGCACTGAGG CAGTGG CAGGAATCTTGTCAGAAATTAC 

TGCTTATTCCATGCCTCCTGTCATCAAAGTGGACCGGC^ 

AAACCTCI^AATGCITCTGTTTCTGGG 

ACATGCATAAGCACTTCGTGCTGTAGTAGATGCTGAATCTGAGGTATCAAACACACACAGGA 

Tfi^rft^fi^Tr^&Tf^OAGGGGAGAGTGTTCCTTTTGTTCTTAACTAGTTTAGGGTC 

AAATAAATACAGTAGTCCCCACTTATCTGAGGGGGATACATTCAAAGACCCCCAGCAGATGC 

CTGAAACGGTGGACAGTGCTGAACCTTATATATATTTTTTCCTACACATACATACCTATGAT 

AAAGTTTAATTTATAAATTAGGCACAGTAAGAGATTAACAATAATAACAACATTAAGTAAAA 

TGAGTTACTTGAACGCAAGCACTGCAATACCATAACAGTCAAACTGATTATAGAGAAGGCTA 

CTAAGTGACTC^TGGGCGAGGAGCATAGACAGTGTGGAGACATTGGGCAAGGGGAGAATTCA 

CATCCTtSGGTGGGACAGAGCAGGACGATGCAAGAT^ 

TGCTTAAGACTTTTAGATTGTTTATTTCTGGAATTTTTC^TTTAATGTTTTTGGAC 

TGACCATGGTTAACTGAGACTGCAGAAAGCAAAACCATGGATAAGGGAGGACTACTACAAAA 

GCATTAAATTGATACATATTTTTTAAAAAAAAAAAAAAAAAAA 



WO 99/63088 ^ff/^VO PCT/US99/12252 

FTOIJRE 292 

MKWPSLLLSVLLAQVWLVPGIAPSPQSPBTPAPQNQTSRVVQAPREBEEDEQEASEEKAGE 
EEKAWLMASRQQLAKETSNFGFSLLRKISMRHIXaN^ 

KRGLHLQALKPTKPGLLPSLFKGLRETLSRNLELGLSQGSFAFIHKDFDVKETFFNLSKRYF 
DTECVPMNFRNASQAKRLMNHY INKETRGKI PKLFDE INPETKL ILVDYI LFKGKWLTPFDP 
VPTEVDTFHLDKYKTI KVPMMYQAGKFASTFDKNFRCHVLlQiPYQ®^TMLVVIMBKMGDHL 
ALEDYLTTDLVETWLRMMICrRNMEWFPKFKIJXJKYEMHEI^ 

TGRNLQVSRVLRRTVIEVDERGTEAVAGILSEITAYSMPPVIKVDRPPHPMIYEETSGMLLP 



LGRWNPTLL 



WO 99/63088 _ _ / PCT/US99/122S2 

FIGURE 293 

CTGGGATCAGCCACTCCAGCTCCCTGAGCACTCTCTACAGAG 

GAGGCTCCTCCTGGTCACCAGCCTGGTGGTTGTC 

CACCCAA0K3TCCCTATCAAGAT 

GCCTGGGGCGCCCGTGTGGTGGAGCCTCCG^^ 

TGTCCAGAAGCCGAAACTCTTGACCACCG^ 

TCCTTCCAGGCACCAAGGCCTGGATGGAG^^ 

GAGCCCGACCATGACAGCCTGTACCACCCrC^ 

CCGGTTGTGGGTGATGCCAAATCACCAGGTGCTCCTGGGACCGGA<3^ 
TCTACCACCCCCAGTAGGGCTCC^ 
CTGTTGGGACTGGGACCCTCCCTACCCTGCCX^ 
AAAAAAAAAAAAAAAA 



/ PCT/US99A2252 

wo 99/63088 -c^s/mo 

FTOTTRE 294 

MRRI^VTSLVWLLWEAGAVPA^ 

FPVQKPKLLTTEEKPRGQGRGPILPGTKAWMETEDTLGRVLSPEPDHDSLYHPPPEEDQGEE 
RPRLWVMPNHQVLLGPEEDQDH IYHPQ 



WO 99/63088 *€^3 /^VO PCT7US99/122S2 

FIGURE 295 

AGAAAGCT<3CACTCT<3TTOAGCTC^ 

TACCCAAGGAAAGTGCAGCrGAGACTCAGACAAGATTACAa^AACC^CTCAGCr^ 
TGTTTCTCATAGCGACCACCAGAGGATGGAGTACAGATGAGGCTAATACTTACTTCAAG 
TGGACCTGTTCTTCGTCTCCATCTCTGCCCAGAAGCTGCAAGGAAATCAAAGACGAATGTCC 
TAGTGCATTTGATGGCCTGTATTTTCTCCGCACTGAGAAT^ 

GTGACATGACCTCTGGGGGTGGCGGCTGGACCCTGGTGGCCAGOGTGCATGAGAATGACATG 

CGTGGGAAGTGCACGGTGGGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACT 

AGAGGGGGAOSGCAACTGGGCCAACTACAACACCTTTGGATC^ 

ATGACTACAAGAACCCTGGCTACTACGACATCCAGGCCAAGGACCTGGGCATCTGGCAGGTG 
CCCAATAAGTCCCCCATGC^GCACTGGAGAAACAGCTCCCTGCTGAGGTACOSCACGGACAC 
TGGCTTCCTCCAGAGACTGGGACATAATCTGTTTGGCATCTACCAGAAATATCCAGTGAAAT 
ATGGAGAAGGAAAGTGTTGGACTGAOU^CGGCCCGGTGATCCCTGTGGTCTATGATTTTGGC 
GACGCCCAGAAAACAGCATCTTATTACTCACCCTATGGCCAGCGGGAATTCACTGCGGGATT 
TGTTCAGTTCAGGGTATTTAATAACGAGAGAGCAGCCAACGCCTTGTGTGCTGGAATGAGGG 
TCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAGGAT^ 

CCCCAGCAGTGTGGAGATTTTTCTGGTTTTGATTGGAGTGGATATGGAACTCATGTTGGTTA 
CAG(^GCAGCCGTGAGATAACTGAGGCAGCTGTGCTTCTATTCTATCX3T5^GAGTTTTGTG 
GGAGGGAACGCAGACCTCTCCrCCCAACCATGAGATCCCAAGGATGGAGAACAACTTACCCA 
GTAGCTAGAATGTTAATGGCAGAAGAGAAAACAATAAATGATATTGACIXZAAGAAAAAAA 



WO99/63088 ^0 O /"M O PCT/US99/12252 

FTGTJRE 296 

MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTEN 
GVIYQTFCDMTSGGGGWTIiVASVHENDMRGKCTVGDRWSSQQGSKADYPEGDGNWANYNTFG 
SAEAATSDDYKNPG YYD I QAKDLGIWHVPNKSPMQHWRNSSLLRYRTDTGFLQTLGHNLFG I 
YQKYPVKYGEGKCWTDNGPVIPVVYDFGDAQKTASYYSPYGQREFTAGPVQFRVFNNERA^ 
ALCAGMRVTGCNTEHHCIGGGGYFPEASPQQC^ 



WO 99/63088 



3o^/svo 



PCT/US99/12252 



FTGTJRE 297 

GCGGAGCCGGCGCC«3CK3CGCAGA 

CACGAGGCTGCCGCATCCTGCCCTCGGAACA^gQGACTCGGCGCGCGAGGTGCTTGGGGCG 
CGCTGCTCCTGGGGAaSCTGCAGGTGCTAGCGCTGCTGGGGGCGGCCCATGAAAGCGCAGCC 

MGGCX3GCATCTGCAAAGATAGAGAATTCTGGGCTTCCA 
AGAGACTCTCCAACATGTGCCTTCrKSACCATACAAATGAAACTTC 
CACCAACTTCAGTTGCCTCAGACTCCAGTAATACAAC 
GCATCTAATACAACAACACGAGGGATGGTCTCAACA^ 

TACACCCAAAACAACAAGTGTTTCACAGAACACATCTCAGATATCAACATCC!ACAA 
TAACCCACAATAGTTCAGTGACATCTGCTGCTTCATCAGTAACAATCACAACAACTATGC^ 

TCTCAAGCAAAGAAAGGATCAAAATTTGATACTGGGAGCTTTGTTOGTTC 

GCTGGGAGTTTTATCTATTCTTTACATTGGATGCAAAATGTATTACra 

GGTATCGAACCATAGATCAACATGATGCC^TC^^ 

ATACAGATTGATGCTGCCCTATCAATTAATTTTGGTTTATTAATAGTTTAAAACAATATTCT 
CTTTTTGAAAATAGTATAAACAGGCCATGCATATAATGTACAGTGTATTACGTAAATATGTA 
AAGATTCTTCAAGGTAACAAGGGTTTGGGTTTTGAAATAAACATCT<^ATCTTATAGAGCGT 

TCATACAATGGTTTTAGCAAGTTCATAGTAAGA 
CCCCTGGGGGC^TTyTT^A^aTaTOACCAGTAATTGAAAGACCTCATCACTGAAAGACAGAA 

TGCCATCTGGGCATACAAATAAGAAGTTTGTCACAGCACTC^GGATTTTGGGTATCTTTTGT 

AGCTCAGATAAAGAACTTCAGTGCTTTTCAGAGCTGGATATATCTTAATTACTAATGCCACA 

CAGAAATTATACAATCAAACTAGATCTGAAGCATAATTTAAGAAAAACATCAACATTTTTTG 

TGCTTTAAACTGTAGTAGTTGGTCTAGAAACAAAATACTCC 



WO 99/63088 / PCT/US99/12252 

FIGURE 298 

MGLGARGAWAALLLGTLQVIALLGAAHESAAMAASANIENSGLPHNSS^ 

TNETSNSTVKPPTSVASDSSNTTVTTMKPTAASNTTTPGMVSTNMTSTTLKSTPKTTSVSQN 

TSQISTSTMTVTHNSSVTSAASSVTITTTMHSEAKKGSKPDT6SFVGGIVLTLGVLSILYIG 

CKMYYSRRGIRYRTIDEHDAI I 



WOW/63088 ^SVO PCT/US99/12252 

FTCTJRE 299 

c^gccgggtcccaagcctgtgcctgagcctc 

tcgcgggggctccgggctgtgggaccgct^ 

cttcttcggcitggctccttgct 

gcagctgtcagacgccgccaagaatttcgaggatgtcagatgtaaa 
ataaagaaaattctgggcatatttat 

catgttgtggagcccatgcctgtgcgggggcctgatgtagaagcatactgtcta 

ATGGAAATATGAAGAAAGAAGCTCTGTCACAATCAAGGT^ 

TTTTGGGCCTTCTACTTCTGTACATGGTATA^ 

CGCCTCTTTGGACATGCACAGTTGATACAGAGTGATGATGA 

TGCAAATGCACACGATGTGCTAGCCCGCTCCCGC^ 

AATATGCACAGCAGTOCTGGAAGCTTCA^ 

CATGTTGTCCTCAGCJ^TTGGGAATTGAATTCAAGGTGACT 

CTGGAAAGAACTGACTGGGTTTTGCTGGGTTTCATTTT 

GTTGCTGGAAGATTCAAAACTGGAAGOUVAAACTTGCT 

ATAATAGAGACATTTTTAAAAGCAC^ 

TGA<mTTACTAATAAAAATAAATClK5GCTGTAAATTATCTTGAAG 

AGCACT^TCTTTT^^ 

TTGTTGTTGTTTTTTGTTTKOT 

AACAACTTTTTTCAAGTCACTTTACTAAACA^ 

TCGAGTTTCATTTATATTTTGCAGTGTAGCCA^ 

ACTTTTGCACTGACTGTATTATCTGGGTATCTC 

CTAAAATGCCTGGTGGCTTTTCACAAAAAGCAGATTT^ 

CAATGCATCCTAGAACAAACTGGCX^TTTGCTAGTTTACTCTAAAG 

GTGTGTGTGGTCTTACTCATCTTCTAGTACCTTTAAGGAC^ 

TGCAATAAAGAAATTTTATTTTAAAOCCAA 

TCAGCATTTCCGGTCGTGGTGAGAGGCAGCTGTT^^ 

AGGGCTGGGGTTCTGGGTGCCTCTTCTGAAAGGTCTAACCATTATTGG 

TCTTCCTATGTCCTCTTTGGAATGTAACAATAAAAAT^ 



WO 99/63088 ^ V O PCT/US99/10252 

FTCTJRE 300 

^TLWGGLI^GSI^SLSCLAI^VIiI^ 
QKDCDCLHVVEPMPWGPDVEAYCLRCTCKYEER^ 

VEPILKRRLFGHAQLIQSDDDIGDHQPFANAHDVL2MRSRSRANVLNKVEYAQQRWKLQVQEQ 
RKSVFDRHWLS 



WO 99/63088 _ / PCT/US99/I22S2 

FTOTJRE 301 

GC^CCTGCGACCACCGTGAGO^TCA^GOGTACTCCACAGTGCAQAGAGTCGCTCTGOC^ 

CTGGGCTTGTCCTGGCTCTGTCGCTGCTGCTGCCCAAGGCCITCCTGTC 

C^GGAGCCGCCGCCGACACCTGAAGGAAAATTGGGCCGATTTC 

CCAGGCACCCTCAGATGGCCAGACTCCTGGGGCTCGT^ 

CATTTGCAAAGGCCAAAGGATCAGGTGGAGGTGCTGGAGGAGGAGGra 

ATGGGGCAGATTATTCCAATCTACGGTTTTGGGATTTTTTTATATA^ 

TAAGGTAAGTAGAATCATCCTAATCATATTACATCAASGj^AAATCTAATATGGCGATAAAAA 

TC^TTGTCTACATTAAAACTTCTTATAGTTC^^ 

AATCCTGCCTCCTCTTCATGAGGTACTTAGGATAGCCATTATTTCAGTTTCACATAAGAATG 

TTTACTCAATGTTTAAGTGTTTTGCCCO^AATTCACAACT 

GAACATGGATCTTTTGGTTCTTAATCCAGTGAGTGATAC^ 
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TTTftTIRE 302 

MAYSTVQRVAIJ^GLVIALSI^PKAFLSR 

PGARFQRSHLAEAFAKAKGSGGGAGGGGSGRGLMGQIIPIYGFGIFLYILYILFKVSRIILI 
ILHQ 
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FTOTTRE 303 

CGGCTCGAGTGCAGCTGTGGGGAGATTTCAGTGCATTGCCTCCCCTGQGTGCTCTTC!ATCTT 
g GATTT ^A a a r.TTY;Ar,RGCAGCATGTTTrGC€CACTGAAACTCATCCTGCTGCCAGTGTTAC 
TGGATTATTCCTTGGGCCTGAATGACTTGAATGTTTC^ 
GGTGATTCAGCTCTGATGGGATGTGTTTTC<^GAGCAC^ 

AGACTGGACTCTGT(^CCAGGAGAGCACGCCAAGGACGAATATGTGCTATACTATTACTC<^ 

ATCTCAGTGTGCCTATTGGGCGCTTCC^GAACCGCGTACACTTG^ 
AATGATGGCTCTCTCCTGCTCCAAGATGTGCAAGAGGCTGACCAGGGAACCTATATCTC 

AATCCGCCTCAAAGGGGAGAGCCAGGTGTTCAAGAAGGCGGTGGTACTGC^ 
AGGAGCCCAAAGAGCTCATGGTCCATGTGGGTGGATTGATTCAGATGGGATGTGTTTTCCIAG 

AGCACAGAAGTGAAACACGTGACCAAGGTAGAATGGATA 

GGAGATTGTATTTCGTTACTACCACAAACTCAGGATGTCTGTGGAGTACTCCCAGAGCTGGG 

GCCACTTCCAGAATCX3TGTGAACCTGGTGGGGGACATTTTCCGCAATGACIGGTTC 

CTTCAAGGAGTGAGGGAGTCAGATGGAGGAAACTACACCTGC^GTAT^CACCTAGGGAACCT 

GGTGTTCAAGAAAACCATTGTGCTGCATGTCAGCCCGGAAGAGCCTCGAACACTGGTGACCC 

CGGCAGCCCTGAGGCCTCTGGTCTTGGGTGGTAATCAGTTGGTGATCATTGTGGGAATTGTC 

TGTGCCACAATCCTGCTGCTCCCTGTTCTGATATTGATCGTGAAGAAGACCTGTGGAAATAA 

GAGTTCAGTGAATTCTACAGTCTTGGTGAAGAACACGAAGAAGACTAATCK^GAGATAA 

AAAAACCCTGCCATTTTGAAAGATGTGAAGGGGAGAAACACATTTACTCCC<^^^ 

rG ^ R ^TnATC^AGGAAGAAG AACCAAGTGAAAAAT<^GAGGCCACCrAC^ 

CCCAGTTTGGCCTTCTCTGAGGTC^GATCGGAAC^CTCACTTGAAAAAAAGTCAGGTGGGG 

GAATGCCAAAAACACAGCAAGCCTTTSGAGAAGAATGGAGAGTCCCTTCATCTGAG 
TGGAGACTCTCTCCTGTGTGTGTGCTGGGCC^CTCTACCAGTGATTTCAGACrcCCXSCTCTC 
CCAGCTGTCCTCCTGTCTCATTGTTTGGTCAATACACTGAAGATGGAGAATTT^ 
CAGAGAGACTGGACAGCTCTGGAGGAACAGGCCTGCTGAGGGGAGGGGAGCATGGACTTCGC 

CTCTGGAGTGGGACACTGGCCCTGGGAACCAGGCTGAGCTGA 

GGATCAGACCCTCCTGTGGGCAGGGTTCTTAGTGGATGAGTTACPGGGAAGAATCAGAGATA 
AAAACCAACCCAAATCAA 
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FIGURE 304 

MFCPLKLILLPVLLDYSLGLNDLNV^ 

EHAKDE YVLYYYSNLS VP I GRFQNRVHLMGD I LCNDGS LLLQDVQEADQGTYI CE I RLKGES 
QVFKKAVVLHVLPEEPKELMVHVGGLIQMGCWQSTEVKH^ 

HKLRMSVEYSQSWGHFQNRVNLVGD I FRNDGS IMLQGVRESDGGNYTCS IHLGNLVFKKTI V 
LHVSPEEPRTLVTPAALRPLVLGGNQLVI^ 

LVKNTKKTNPE I KEKPCHFERCEGEKH I YS P 1 1 VRE VI EEEE PSEKSEATYMTMHP VWPSLR 
SDRNNSLEKKSGGGMPKTQQAF 
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FTOTTRE 30S 

CTATGAAGAAGCTTCCTGGAAAACAATAAGCAAAGG 

GTTCTACCCTACTAAAGACAGGAAGATCATAAACTGACAGATACTGAAATTGTAAGAGTTGG 

AAACTACATTTTGCAAAGTCATTGAACTCrGAGCTCAGTTGCAGTACT 

(^TGAAGATGGATACATCACCTTAAATATTAAAACT^ 

GCCCTGCATCCTCCTC<n^TGGCGTGTGATGGCTTTGATTCra 

ATGGTTGTCGGGCTGGTGGCTCTGGGGATTTGGTCTGTCATGCAGCGCyU\.TTACCT 

TGAGAATGAAAATCGCACAGGAACTCTGCAACAATTAGCAAAGOGCTTCTGTCAM 

TAAAACAATCAGAACTAAAGGGCACTTTCAAAGGTCATAAATGCAGCCCCTGTGACACAAAC 

TGGAGATATTATGGAGATAGCTGCTATGGGTTCTTCAGG 

TAAGCAGTACTGCACrGACATGAATGCTACT^ 

AGTACATCAAAGCCAGGACTCATTTAATTCX3TTGGGTCX3GATTA 

GAGGTCTGGAAGTGGGAGGATGGCTCGGTTATCTCAGAA 

TGGAAAAGGAAATATGAATTGTGCTTATTTTCATAATGGGAAA^ 

AGAACAAACATTATTTAATGTGTGAGAGGAAGGCTGGCATGACCAAGGTGGACCAACTACCT 
TAATGCAAAGAGGTGGA<^GGATAACACAGATAAGGGCTTTATTGTACAATAAAAGATATGT 

ATGAATGCATCAGTAGCTGAAAAAAAAAAAAAA 
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FIGURE 306 

MQDEDGYITLNIKTRKPALVSVGPASSSWVniVMALILLILCVGMWGLVALGIWSVMQRNYL 
QDENEl^TGTLQQIAKRFCQYVVKQSELKGTFKGHKCSPCDTNWRyYGDSCYGFFRHN^ 
ESKQYCTDMNATLLK I DNRNI VEY I KARTHL I RWVGLS RQKSNEVWKWEDGS VI SENMFEFL 
EDGKGNMNCAYFHNGKhMPTFCENKHYLMCERKAGMTKVDQLP 
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Claims 1 and 5-11 relate to polynucleotides defined as having at least 
86% sequence identity to a nucleotide sequence that encodes a polypeptide 
comprising an amino acid sequence selected from a group of polypeptides 

listed in claim 1 (SEQ.IDs. 2, 6, 8, 14, 28 ). Back-translation of 

the polypeptide into DMA generates a very great number of nucleic acid 
sequences. It is not possible to search an entire database with this 
enormous set of sequences. The search thus has been limited to nucleic 
acid sequences having at least 80% homology with the nucleotide sequences 
as listed in claim 2 (SEQ.IDs. 1, 5, 7, 13, 19 ) . 



. Claims: Invention 1: Claims 1-26 (all partially) 

A membrane-bound protein as represented by SEQ.ID.2 and 
varlantfnaving 89% amino acid sequence identity therewith, 
a wcleic acid encoding said protein as represented by 
SEQ ID?1 and variants having 80% nuc eotide sequence 
identify therewith; a vector comprising said nucleic acid; a 
lost cell comprising said vector; a process for the 
product on of said protein; a chimeric mo ecule comprising 
Sid protein; an antibody specific.for said protein; an 
^cellular domain of said protein; and a variant of said 
protein lacking its signal sequence. 

2. Claims: Inventions 2-135: Claims 1-26 (all partially) 

Idem as subject 1 but limited to one DMA «quence selected 
from SEQ. IDs. 1-424 and the corresponding polypeptide, 
ISrein invention 2 is limited to SEQ.IDs. 5 .and 6, 
invention 3 is limited to SEQ.IDs. 7 and.8, invention 4 is 
limited to SEQ.IDs. 13 and 14 ..... and invention 135 is 
limited to SEQ.IDs. 423 and 424. 
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